( ,2)(19)(ca) Demande-Application 

one M/Mt^S CIPO 

Owe. Mi* Canadian Intellectual 2,251,262 

r \ Property Office (21) (Al) M.)**v*m 

intellectuelle du Canada . v ^ 1998/1 1/18 

(43) 1999/05/20 

(72) DIETRICH, Paul Shartzer, US 

(72) FISH, Linda Marie, US 

(72) KHARE, Reena, US 

(72) RABERT, Douglas Kenneth, US 

(72) SANGAMESWARAN, Lakshmi, US 

(7i) F HOFFMANN-LA ROCHE AG, CH 

(SDlntCl. 6 C12N 15/12, C07K 14/705, C12Q 1/68, G01N 33/68, C07K 16/28 
(30) 1997/1 1/20 (60/066,225) US cnmniTF TW 

( 54) CODAGE D'ACIDE NUCLEIQUE POUR CANAL SODIQUE DE 

TT^mTT NFTIVEUX 
(54) NUCLEIC ACID ENCODING A NERVOUS TISSUE SODIUM 

CHANNEL 



res: 



islant, sodium channel as a therapeutic target tor compounds. 



1*1 Industrie Canada Industry Canada 



ABSTRACT OF THE DISCLOSURE 

expression of these sequences, expression v ^ ^ ^ whose 

transformed with these expression vectors. UU> P ^ ^ 



compounds 



Ref. 20016 

1 

This invention relates generally to sodium channel proteins and more particularly to a 
novel nucleic acid sequence encoding for a mammalian a-subunit of a voltage-gated, 
preferably tetrodotoxin-resistant, nervous tissue sodium channel protein. This invention 
further relates to its production by recombinant technology. 

5 The basic unit of information transmitted from one part of the nervous system to 

another is a single action potential or nerve impulse. The ^transmission line" for these 
impulses is the axon, or nerve fiber. The electrical excitability of the nerve membrane has 
been shown to depend on the membrane's voltage-sensitive ionic permeability system that 
allows it to use energy stored in ionic concentration gradients. Electrical activity of the nerve 

10 is triggered by a depolarization of the membrane, which opens channels through the membrane 
that are highly selective for sodium ions, which are then driven inward by the electrochemical 
gradient. Of the many ionic channels, the voltage-gated or voltage-sensitive sodium channel is 
one of the most studied. It is a transmembrane protein that is essential for the generation of 
action potentials in excitable cells. An excellent review of sodium channels is presented in 

15 Catterall, TINS 16(12), 500-506 (1993). 

The cDNAs for several Na + channels have been cloned and sequenced. Numa et al, 
Annals of the New York Academy of Sciences 479, 338-355 (1986), describe cDNA from the 
electric organ of eel and two different ones from rat brain. Rogart, U.S. Patent No. 5,380,836, 
describes cDNA from rat cardiac tissue. See also Rogart et al, Proc. Natl. Acad. Sci. 86, 

20 8170-8174 (1989). The sequence of PN1 and its orthologs in humans (hNE) and rabbits 

(Na + s) have been published (see, for example, Klugbauer et a/., EMBOJ 14, 1084-1090 (1995) 
and Belcher et al t Proc. Natl. Acad. Sci. U.S.A. 923, 11034-11038 (1995)). The sequence of 
rat PN1 cloned from DRG and its function expression have been described (see, for example, 
Sangameswaran et al, J.Biol.Chem. 272, 14805-14809 (1997)). Other cloned sodium 

25 channels include rat brain types I and n, Noda et al, Nature 320, 188-192 (1986), Ha, Auld et 
aU Neuron 1, 449-461 (1988), and III, Kayano et al, FEBS Lett. 228, 187-194 (1988), rat 

11.9.98/Ar/vh 



skeletal muscle (SkMl), Trimmer et al, Neuron 3, 33-49 (1989), rat NaCh6, Schaller et al, J- 
Neurosci. 15, 3231-3242 (1995), rat peripheral nerve sodium channel type 3 (rPN3), 
Sangameswaran et al, J. Biol Chem. 271, 5953-5956 (1996), also called SNS, Akopian et a/., 
Nature 379, 257-262 (1996), rat atypical channel, Felipe et a/., J. Biol. Chem. 269, 30125- 
5 30131 (1994), and the rat glial sodium channel, Akopian et al, FEBS Lett. 400, 183-187 
(1997). 

These studies have shown that the amino acid sequence of the Na + channel has been 
conserved over a long evolutionary period. These studies have also revealed that the channel 
is a single polypeptide containing four internal repeats, or homologous domains (domains I- 

10 IV), having similar amino acid sequences. Each domain folds into six predicted and helical 
transmembrane segments: five are hydrophobic segments and one is highly charged with 
many positively charged lysine and arginine residues, This highly charged segment is the 
fourth transmembrane segment in each domain (the S4 segment) and is likely to be involved in 
voltage-gating. The positively charged side chains on the S4 segment are likely to be paired 

15 with the negatively charged side chains on the other five segments such that membrane 

depolarization could shift the position of one helix relative to the other, thereby opening the 
channel. Accessory subunits may modify the function of the channel. 

Therapeutic utility in recombinant materials derived from the DNA of the numerous 
sodium channels have been discovered. For example, U.S. Patent No. 5,132,296 by Cherksey 

20 discloses purified Na + channels that have proven useful as therapeutic and diagnostic tools. 

Isoforms of sodium channels are divided into „subfamilies" The term Jsoform" is 
used to mean distinct but closely related sodium channel proteins, i.e., those having an amino 
acid homology of approximately 60-80%. These also show strong homology in functions. 
The term „subfamilies" is used to mean distinct sodium channels that have an amino acid 

25 homology of approximately 80-95%. Combinations of several factors are used to determine 
the distinctions within a subfamily, for example, the speed of a channel, chromosomal 
location, expression data, homology to other channels within a species, and homology to a 
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channel of the same subfamily across species. Another consideration is an affinity to 
tetrodotoxin („TTX"). TTX is a highly potent toxin from the puffer or fugu fish which blocks 
the conduction of nerve impulses along axons and in excitable membranes of nerve fibers. 
TTX binds to the Na + channel and blocks the flow of sodium ions. 

Studies employing TTX as a probe have shed much light on the mechanism and 
structure of Na + channels. There are three Na + channel subtypes that are defined by the 
affinity for TTX, which can be measured by the IC 50 values: TTX-sensitive Na + channels (IC 50 
= 1-30 nM), TTX-insensitive Na + channels (IC 50 » 1-5 nM), and TTX-resistant Na + channels 
(IC 50 £50nM). 

TTX-insensitive action potentials were first studied in rat skeletal muscle (Redfem et 
aL, Acta Physiol. Scand. 82, 70-78 (1971)). Subsequently, these action potentials were 
described in other mammalian tissues, including newborn mammalian skeletal muscle, 
mammalian cardiac muscle, mouse dorsal root ganglion cells in vitro and in culture, cultured 
mammalian skeletal muscle and L6 cells. See Rogart, Ann. Rev. Physiol. 43, 711-725 (1980). 

Rat dorsal root ganglia neurons possess both TTX-sensitive (IC 50 - 0.3 nM) and TTX- 
resistant (IC 50 ~ 100 |xM) sodium channel currents, as described in Roy et aL, J. Neurosci. 12, 
2104-21 1 1 (1992). TTX-resistant sodium currents have also been measured in rat nodose and 
petrosal ganglia. See Dceda et ai, J. Neurophysiol. 55, 527-539 (1986) and Stea et al, 
Neurosci. 47, 727-736 (1992). Electrophysiologists believe that another TTX-resistant sodium 
channel is yet to be detected. 

Though cDNAs from rat skeletal muscle, heart and brain are known, identification and 
isolation of cDNA from peripheral sensory nerve tissue, such as dorsal root ganglia, has been 
hampered by the difficulty of working with such tissue. 

SUMMARY OF THE INVENTION 
The present invention provides novel purified and isolated nucleic acid sequences 
encoding mammalian, preferably TTX-resistant, nervous tissue sodium channel proteins that 



are strongly expressed in adult DRG and nodose ganglia, less strongly expressed in brain, 
spinal cord and superior cervical ganglia, and not expressed in sciatic nerve, heart or skeletal 
muscle. In presently preferred forms, novel DNA sequences comprise cDNA sequences 
encoding rat nervous tissue sodium channel protein. One aspect of the present invention is the 
5 a-subunit of this sodium channel protein. 

Disclosed is the DNA, cDNA, and mRNA derived from the nucleic acid sequences of 
the invention and the cRNA derived from the mRNA. Specifically, two cDNA sequences 
together encode for the full length rat nervous tissue sodium channel. 

Also included in this invention are alternate DNA forms, such as genomic DNA, DNA 
10 prepared by partial or total chemical synthesis from nucleotides, and DNA having deletions or 
mutations. 

Still another aspect of the invention is the novel rat TTX-resistant sodium channel 
protein and fragments thereof, encoded by the DNA of this invention. 

Another aspect of the present invention are recombinant polynucleotides and 
15 oligonucleotides comprising a nucleic acid sequence derived from the DNA sequence of this 
invention. 

Another aspect of the invention is a method of stabilizing the full length cDNA which 
encodes the protein sequence of the invention. 

Further aspects of the invention include expression vectors comprising the DNA of the 
20 invention, host cells transformed or transfected by these vectors, and a cDNA library of these 
host cells. 

Also forming part of this invention is an assay for inhibitors of the sodium channel 
protein comprising contacting a compound suspected of being an inhibitor with expressed 
sodium channel and measuring the activity of the sodium channel. 
25 Further provided is a method of inhibiting the activity of the TTX-resistant sodium 

channel comprising administering an effective amount of a compound having an IC 50 of 10 
uM or less. 



Additionally provided are methods of employing the DNA for forming monoclonal and 
polyclonal antibodies, for use as molecular targets for drug discovery, highly specific markers 
for specific antigens, detector molecules, diagnostic assays, and therapeutic uses, such as pain 
relief, a probe for the PN5 channel in other mammalian tissue, designing therapeutics and 
5 screening for therapies. 

BRIEF DESCRIPTION OF THE SEP ID'S AND FIGURES 
Figures 1 A-E depict the 5908 nucleotide cDNA native sequence encoding the rat 
sodium channel type 5 („PN5") (SEQ ID NO: 1), derived from two overlapping cDNA clones, 
designated 26.2 and 1.18. 
10 Figures 2A-F depict the deduced amino acid sequence of PN5 (SEQ ID NO: 2, 

represented in the three-letter amino acid code). Figures 2G-H, depicting the deduced amino 
acid sequence of PN5 in single letter amino acid code, also show the homologous domains (I- 
IV); the putative transmembrane segments (S1-S6); the amino acid conferring resistance to 
TTX (♦); N-glycosylation sites (•); cAMP-dependent protein kinase A (PKA) 
15 phosphorylation site (0); and the termination codon (*). 

Figure 3A depicts an 856 base pair sequence for the human PN5 (SEQ ID NO: 3). 
Figure 3B depicts the amino acid sequence comparison of the hPN5 fragment with rat PN5. 

Figure 4 depicts the sequence for the novel sodium channel domain IV probe (SEQ ID 

NO: 4). 

20 Figures 5A-E depict the 5334 nucleotide sequence modified for stability and 

expression (SEQ ID NO: 5). Nucleotides 24 to 5518 constitute the 5295 bp region coding for 
a 1765 amino acid protein. 

Figure 6 depicts the cloning map of PN5. 

25 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a purified and isolated nucleic acid sequence encoding 
for a novel mammalian, preferably TTX-resistant, sodium channel protein. The term "purified 
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and isolated DNA" refers to DNA that is essentially free, i.e. contains less than about 30%, 
preferably less than about 10%, and even more preferably less than about 1%, of the DNA 
with which the DNA of interest is naturally associated. Techniques for assessing purity are 
well known to the art and include, for example, restriction mapping, agarose gel 

5 electrophoresis, and CsCl gradient centrifugation. 

The term "DNA" is meant to include „cDNA", or complementary DNA, which is 
single-stranded or double-stranded DNA sequences made by reverse transcription of mRNA 
isolated from a donor cell or by chemical synthesis. For example, treatment of mRNA with a 
reverse transcriptase such as AMV reverse transcriptase or M-MuLV reverse transcriptase in 

10 the presence of an oligonucleotide primer will furnish an RNA-DNA duplex which can be 
treated with RNase H, DNA polymerase, and DNA ligase to generate double-stranded cDNA. 
If desired, the double-stranded cDNA can be denatured by conventional techniques such as 
heating to generate single-stranded cDNA. The term „cDNA" includes cDNA that is a 
complementary copy of the naturally occurring mRNA ,as well as complementary copies of 

15 variants of the naturally occurring mRNA that have the same biological activity. Variants 
would include, for example, insertions, deletions, sequences with degenerate codons and 
alleles. 

„cRNA" corresponding to mRNA transcribed from a DNA sequence encoding the cc- 
subunit of a novel, preferably TTX-resistant, sodium channel protein is contemplated by this 
20 invention. The term „cRNA" refers to RNA that is a copy of the mRNA transcribed by a cell. 
Specifically, the invention encompasses DNA having the native versions of the 
nucleotide sequences set forth in Figures 1A-E (SEQ ID NO: 1) designated herein as sodium 
channel type 5 (PN5). Figures 1A-E depict the 5908 nucleotide cDNA construct comprising a 
5298-base (counting the stop codon) open reading frame (SEQ ID NO:l). Nucleotide residue 
25 79 represents the start site of translation and residue 5376 represents the end of the stop codon. 
The invention also encompasses engineered versions of PN5, and specifically the 
version as set forth in Figures 5A-E (SEQ ID NO: 5). This 5334 nucleotide Sall-Xbal clone 
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lacks most of the untranslated sequences, the 5298 nucleotide open reading frame beginning at 
nucleotide 24 and ending at nucleotide 5321. The start and stop codons are underlined, as are 
the translationally stlent mutations at nucleotides 3932, 3935, 3941, 3944, and 3947, which 
were introduced to block rearrangement in this region during growth in E. Coli. 
5 The nucleotide sequence of SEQ ID NO: 1 (Figures 1 A-E) corresponds to the cDNAs 

from rat. A homology search provided that the closest related sodium channel is found in the 
rat cardiac channel, with 72.5% homology. The next closely related channels are rPNl, with 
72% and rat brain types I and ffl, with 71.8% and 71.3% respectively. Homology to rPN3a, 
hPN3, rPN4, rPN4a, rat brain type II and rat skeletal muscle are each approximately 70 to 
10 71%. 

Additionally, an 856 base pair clone (SEQ ID NO: 3) as shown in Figure 3A has been 
isolated from a human dorsal root gangha (DRG) „cDNA library" and is closely related to the 
rat PN5 amino acid sequence with 79% identity and 86% homology. The human PN5 
sequence spans the region between mSl and interdomain IIMV which includes the fast 
15 inactivation gate (i.e., IFM) that is located within interdomain III/IV. 

The term „cDNA library" refers to a collection of clones, usually in a bacteriophage, or 
less commonly in bacterial plasmids, containing cDNA copies of mRNA sequences derived 

from a donor cell or tissue. 

It is believed that additional homologs of the novel rat TTX-resistant sodium channel 

20 described herein are also expressed in other mammalian tissue. 

Northern blot analysis (Example 5) indicates that PN5 is encoded by a -6.5 kb 

transcript. 

The deduced amino acid sequence of PN5, shown in Figures 2A-F (SEQ TD NO: 2), 
exhibits the primary structural features of an cc-subunit of a voltage-gated, TTX-resistant 
25 sodium channel. Shown in Figures 2G-H are the homologous domains (HV); the putative 
transmembrane segments (S1-S6); the amino acid conferring resistance to TTX (♦); N- 
glycosylation sites (•); and cAMP-dependent PKA phosphorylation sites (0). DNA sequences 
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encoding the same or allelic variant or analog sodium channel protein polypeptides of the 
nervous system, through use of, at least in part, degenerate codons are also contemplated by 
this invention. 

An interesting feature of this deduced amino acid sequence is that the amino acid that 
5 is most responsible for TTX-sensitivity is located at position 355 and is not aromatic. In rat 
and human brain type sodium channels, skeletal muscle channel, and in PN1 and PN4, this 
amino acid is tyrosine or phenylalanine and these channels are all TTX-sensitive. In PN3 and 
PN5, the amino acid is a serine. Since PN3 is highly resistant to TTX, the implication is that 
PN5 is also a TTX-resistant channel. The cardiac channel has a cysteine at this position and is 

10 ..insensitive" to TTX. 

Although PN5 contains all of the hallmark features of a voltage-gated sodium channel, 
it has unique structural features that distinguish it from other sodium channels. For example, 
DHS4 has 5 basic amino acids conserved in all sodium channels that could play a significant 
role in the voltage sensing aspects of the channel function. In PN5, the first basic amino acid 

15 is replaced by an alanine. Similarly, in DIIIS4, PN5 has 5 basic amino acids rather than six 
that are present in other sodium channel sequences, the last arginine replaced by a glutamine. 
In DfflS3, the transmembrane segment contains only 18 amino acids, in contrast to 22 amino 
acids in other channels. Also, the short linker (4 amino acids) loop between S3 and S4 in DIE 
is even shorter by a .deletion' of 3 amino acids. This shortening of the S3 and the linker loop 

20 has been confirmed by designing primers in the appropriate region of the sequence for an RT- 
PCR experiment from rat DRG and sequencing the amplified DNA fragment. Such an 
experiment has been performed to confirm the sequence of another region of PN5, in the 
DIVS5-S6 loop, where there was a deletion of an 8 amino acid peptide. 

Reverse transcription-polymerase chain reaction (oligonucleotide-primed RT-PCR) 

25 tissue distribution analysis of RN A from the rat central and peripheral nervous systems, in 
particular from rat DRG, was performed. Eight main tissue types were screened for 
expression of the unique PN5 genes corresponding to positions 5651-5903 of SEQ ID NO: 1 
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(Figures 1A-E). PN5 mRNA was present in five of the tissues studied: brain, spinal cord, 
DRG, nodose ganglia, and superior cervical ganglia. PN5 was not present in the remaining 
tissues studied: sciatic nerve tissue, heart or skeletal muscle tissue. PN5 was found to be the 
strongest in DRG and nodose ganglia, leading the applicants to believe that the DRG is 
5 enriched with PN5. PN5 shows dramatic abundance differences across a range of tissues. 
PN5 has a gradient of expression with high expression in DRG. PN5 has a gradient of 
expression like other channels, but more limited distribution. 

The invention not only includes the entire protein expressed by the cDNA sequences of 
SEQ ID NOS: 1, 2 and 3, but also includes protein fragments. These fragments can be 
1 0 obtained by cleaving the full length proteins or by using smaller DNA sequences or 
..polynucleotides" to express the desired fragment. 

The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of 
any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the 
primary structure of the molecule. Thus, this term includes double- and single-stranded DNA, 
15 as well as double- and single-stranded RNA. It also includes modified, for example, by 
methylation and/or by capping, and unmodified forms of the polynucleotide. 

Further, the term "polynucleotide" is intended to include a recombinant polynucleotide, 
which is of genomic, cDNA, semisynthetic or synthetic origin which, by virtue of its origin or 
manipulation is not associated with all or a portion of the polynucleotide with which it is 
20 associated in nature and/or is linked to a polynucleotide other than that to which it is linked in 
nature. 

Accordingly, the invention also includes polynucleotides that can be used to make 
polypeptides of about 10 to 1500, preferably 10 to 100, amino acids in length. The isolation 
and purification of such recombinant polypeptides can be accomplished by techniques that are 
25 well known in the art, for example, preparative chromatographic separations or affinity 
chromatography. In addition, polypeptides can also be made by synthetic means which are 
well known in the art. 



The invention allows for the manipulation of genetic materials by recombinant 
technology to produce polypeptides that possess the structural and functional characteristics of 
the novel voltage-gated, TTX-resistant sodium channel a-subunit found in sensory nerves. 
Site directed mutagenesis can be used to provide such recombinant polypeptides. For 
5 example, synthetic oligonucleotides can be specifically inserted or substituted into the portion 
of the gene of interest to produce genes encoding for and expressing a specific mutant. 
Random degenerate oligonucleotides can also be inserted and phage display techniques can be 
used to identify and isolate polypeptides possessing a functional property of interest. 

In addition, the present invention contemplates recombinant polynucleotides of about 
10 15 to 20kb, preferably 10 to 15kb, nucleotides in length, comprising a nucleic acid sequence 
„derived from" the DNA of the invention. 

The term "derived from" a designated sequence, refers to a nucleic acid sequence that 
is comprised of a sequence of approximately at least 6 to 8 nucleotides, more preferably at 
least 10 to 12 nucleotides, and, even more preferably, at least 15 to 20 nucleotides that 
15 correspond to, i.e., are homologous or complementary to, a region of the designated sequence. 
The derived sequence is not necessarily physically derived from the nucleotide sequence 
shown, but may be derived in any manner, including for example, chemical synthesis or DNA 
replication or reverse transcription, which are based on the information provided by the 
sequences of bases in the region(s) from which the polynucleotide is derived. 
20 A neonatal expression test was performed with Fl 1, a fusion cell line designed from 

neonatal rat DRG fused with a mouse cell line, N18TG, from Massachusetts General Hospital. 
Fl 1 responds to trophic agents, such as NGF, by extending dendrites. It was found that PN5 
was present in both native Fl 1 and Fl 1 treated with NGF, leading the applicants to believe 
that the sodium channel is natively expressed in Fl 1. 
25 In situ hybridization of PN5 mRNA to rat DRG tissue provides localization 

predominantly in the small and medium neurons with no detection in large neurons. 
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PN5 was also mapped to its cytogenetic location on mouse chromosome preparations. 
PN5 maps to the same chromosome as the cardiac channel and PN3. 

In general, sodium channels comprise an a- and two 6-subunits. The B-subunits may 
modulate the function of the channel. However, since the a-subunit is all that is required for 

5 the channel to be fully functional, expression of the cDNA in SEQ ID NO: 1 (Figures 1 A-E) 
will provide a fully functional protein. The gene encoding the B r subunit in peripheral nerve 
tissue was found to be identical to that found in rat heart, brain and skeletal muscle. The 
cDNA of the B r subunit is not described herein as it is well known in the art, see Isom ex al, 
Neuron 12, 1183-1 194 (1994). However, it is to be understood that by combining the known 

10 sequence for the B,-subunit with the a-subunit sequence described herein, one may obtain 
complete PN5 voltage-gated, preferably TTX-resistant, sodium channel. 

The present invention also includes expression vectors" comprising the DNA or the 
cDNA described above, host cells transformed with these expression vectors capable of 
producing the sodium channel of the invention, and cDNA libraries comprising such host 

15 cells. 

The term "expression vector" refers to any genetic element, e.g., a plasmid, a 
chromosome, a virus, behaving either as an autonomous unit of polynucleotide expression 
within a cell or being rendered capable of replication by insertion into a host cell chromosome, 
having attached to it another polynucleotide segment, so as to bring about the replication 

20 and/or expression of the attached segment. Suitable vectors include, but are not limited to, 
plasmids, bacteriophages, and cosmids. Vectors will contain polynucleotide sequences which 
are necessary to effect ligation or insertion of the vector into a desired host cell and to effect 
the expression of the attached segment. Such sequences differ depending on the host 
organism, and will include promoter sequences to effect transcription, enhancer sequences to 

25 increase transcription, ribosomal binding site sequences and transcription and translation 
termination sequences. 
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The term "host cell" generally refers to prokaryotic or eukaryotic organisms and 
includes any transformable or transferable organism which is capable of expressing a protein 
and can be, or has been, used as a recipient for expression vectors or other transferred DNA. 
Host cells can also be made to express protein by direct injection with exogenous cRNA 
translatable into the protein of interest. A preferred host cell is the Xenopus oocyte. 

The term "transformed" refers to any known method for the insertion of foreign DNA 
or RNA sequences into a host prokaryotic cell. The term „transfected" refers to any known 
method for the insertion of foreign DNA or RNA sequences into a host eukaryotic cell. Such 
transformed or transfected cells include stably transformed or transfected cells in which the 
inserted DNA is rendered capable of replication in the host cell. They also include transiently 
expressing cells which express the inserted DNA or RNA for limited periods of time. The 
transformation or transfection procedure depends on the host cell being transformed. It can 
include packaging the polynucleotide in a virus as well as direct uptake of the polynucleotide, 
such as, for example, lipofection or microinjection. Transformation and transfection can result 
in incorporation of the inserted DNA into the genome of the host cell or the maintenance of 
the inserted DNA within the host cell in plasmid form. Methods of transformation are well 
known in the art and include, but are not limited to, viral infection, electroporation, 
lipofection, and calcium phosphate mediated direct uptake. 

It is to be understood that this invention is intended to include other forms of 
expression vectors, host cells, and transformation techniques which serve equivalent functions 
and which become known to the art hereto. 

The invention also pertains to an assay for inhibitors of the novel TTX-resistant 
sodium channel protein comprising contacting a compound suspected of being an inhibitor 
with expressed sodium channel and measuring the activity of the sodium channel. The 
compound can be a substantially pure compound of synthetic origin combined in an aqueous 
medium, or the compound can be a naturally occurring material such that the assay medium is 
an extract of biological origin, such as, for example, a plant, animal, or microbial cell extract. 

12 



PN5 activity can be measured by methods such as electrophysiology (two electrode voltage 
clamp or single electrode whole cell patch clamp), guanidinium ion flux assays, and toxin- 
binding assays. An "inhibitor" is defined as generally that amount that results in greater than 
50% decrease in PN5 activity, preferably greater than 70% decrease in PN5 activity, more 
5 preferably greater than 90% decrease in PN5 activity. 

Many uses of the invention exist, a few of which are described below: 
1. Probe for mamalian channels. 

As mentioned above, it is believed that additional homologs of the novel rat TTX- 
resistant sodium channel described herein are also expressed in mammalian tissue, in 
10 particular, human tissue. The entire cDNAs of PN5 rat sodium channels of the present 
invention can be used as a probe to discover whether additional novel PN5 voltage-gated, 
preferably TTX-resistant, sodium channels exist in human tissue and, if they do, to aid in 
isolating the cDNAs for the human protein. 

The human homologues of the rat TTX-resistant PN5 channels can be cloned using a 
15 human DRG cDNA library. Human DRG are obtained at autopsy. The frozen tissue is 
homogenized and the RNA extracted with guanidine isothiocyanate (Chirgwin et ai 
Biochemistry 18, 5294-5299, (1979)). The RNA is size-fractionated on a sucrose gradient to 
enrich for large mRNAs because the sodium channel a-subunits are encoded by large 
(7-11 kb) transcripts. Double-stranded cDNA is prepared using the Superscript Choice cDNA 
20 kit (GIBCO BRL) with either oligo(dT) or random hexamer primers. EcoRI adapters are 
ligated onto the double-stranded cDNA which is then phosphorylated. The cDNA library is 
constructed by ligating the double-stranded cDNA into the bacteriophage-lambda ZAP H 
vector (Stratagene) followed by packaging into phage particles. 

Phage are plated out on 150 mm plates on a lawn of XLI-Blue MRF bacteria 
25 (Stratagene) and plaque replicas are made on Hybond N nylon membranes (Amersham). 
Filters are hybridized to rat PN5 cDNA probes by standard procedures and detected by 
autoradiography or chemiluminescence. The signal produced by the rat PN5 probes 
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hybridizing to positive human clones at high stringency should be stronger than obtained with 
rat brain sodium channel probes hybridizing to these clones. Positive plaques are further 
purified by limiting dilution and re-screened by hybridization or PCR. Restriction mapping 
and polymerase chain reaction will identify overlapping clones that can be assembled by 
5 standard techniques into the full-length human homologue of rat PN5. The human clone can 
be expressed by injecting cRNA transcribed in vitro from the full-length cDNA clone into 
Xenopus oocytes, or by transfecting a mammalian cell line with a vector containing the cDNA 
linked to a suitable promoter. 

2. Antibodies Against PN5. 
1 0 The polypeptides of the invention are highly useful for the development of antibodies 

against PN5. Such antibodies can be used in affinity chromatography to purify recombinant 
sodium channel proteins or polypeptides, or they can be used as a research tool. For example, 
antibodies bound to a reporter molecule can be used in histochemical staining techniques to 
identify other tissues and cell types where PN5 are present, or they can be used to identify 
1 5 epitopic or functional regions of the sodium channel protein of the invention. 

The antibodies can be monoclonal or polyclonal and can be prepared by techniques that 
are well known in the art. Polyclonal antibodies are prepared as follows: an immunogenic 
conjugate comprising PN5 or a fragment thereof, optionally linked to a carrier protein, is used 
to immunize a selected mammal such as a mouse, rabbit, goat, etc. Serum from the 
20 immunized mammal is collected and treated according to known procedures to separate the 
immunoglobulin fraction. 

Monoclonal antibodies are prepared by standard hybridoma cell technology based on 
that reported by Kohler and Milstein in Nature 256, 495-497 (1975). Spleen cells are obtained 
from a host animal immunized with the PN5 protein or a fragment thereof, optionally linked to 
25 a carrier. Hybrid cells are formed by fusing these spleen cells with an appropriate myeloma 
cell line and cultured. The antibodies produced by the hybrid cells are screened for their 
ability to bind to expressed PN5 proteins. 
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A number of screening techniques well known in the art, such as, for example, forward 
or reverse enzyme-linked immunosorbent assay screening methods, may be employed. The 
hybrid cells producing such antibodies are then subjected to recloning and high dilution 
conditions in order to select a hybrid cell that secretes a homogeneous population of antibodies 

5 specific to either the PN5 protein. 

In addition, antibodies can be raised by cloning and expressing nucleotide sequences or 
mutagenized versions thereof coding at least for the amino acid sequences required for specific 
binding of natural antibodies, and these expressed proteins used as the immunogen. 
Antibodies may include the complete immunoglobulin or a fragment thereof. Antibodies may 
10 be linked to a reporter group such as is described above with reference to polynucleotides. 
Example 10 illustrates practice of producing an antibody. 

3. Therapeutic Targets for Compounds to Treat Disorders and Assays Thereof. 

The present invention also includes the use of the novel voltage-gated, preferably TTX- 
resistant, sodium channel a-subunit as a therapeutic target for compounds to treat disorders of 
15 the nervous system based on the RT-PCR localization data. The disorders include, but are not 
limited to, epilepsy, stroke injury, brain injury, diabetic neuropathy, traumatic injury, chronic 
neuropathic pain, and AIDS-associated neuropathy. 

4. Designing Therapeutics based on Inhibiting PN5 and assays thereof. 

This invention is also directed to inhibiting the activity of PN5 in brain, spinal cord, 
20 DRG, nodose ganglia, and superior cervical ganglia tissues. However, it is to be understood 
that further studies may reveal that PN5 is present in other tissues, and as such, those tissues 
can also be targeted areas. For example, the detection of PN5 mRNA in nodose ganglia 
suggests that PN5 may conduct TTX-resistant sodium currents in this and other sensory 

ganglia of the nervous system. 
25 In addition, it has been found that proteins not normally expressed in certain tissues are 

expressed in a disease state. Therefore, this invention is intended to encompass the inhibition 
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of PN5 in tissues and cel. .ypes where the protein is normally expressed, and in those tissues 
and cell types where the protein .s only expressed during a disease state. 

For example, it is believed that TTX-resistant sodium channels play a key role in 
transmitting nerve impulses relating to sensory inputs such as pain and pressure. This 
information will facilitate the design of therapeutics that can be targeted to a specific area such 

as peripheral nerve tissue. 

The recombinant protein of the present invention can be used to screen for potential 
therapeutics that have the ability to inhibit the sodmm channel of interest. In particular, it 
would be useful to inhibit selectively the function of sodium channels in peripheral nerve 
tissues responsible for transmitting pain and pressure signals without simultaneously affecting 
the function of sodium channels in other tissues such as heart and muscle. Such selectivity 
would allow for the treatment of pain without causing side effects due to cardiac or 
neuromuscular completions. Therefore, it would be useful to have DNA sequences coding 
for sodium channels that are selectively expressed in peripheral nerve tissue. 

5 5. Pain Reliever. 

Sodium channels in peripheral nerve tissue play a large role in the transmisston of 
nerve impulses, and therefore are instrumental in understanding neuropathic pain transmission. 
Neuropathic pain falls into two components: allodynia, where a normally non-painful stimulus 
becomes painful, and hyperalgesia, where a usually normal painful stimulus becomes 

20 extremely painful. 

In tissue localization studies, PN5 mRNA maps small and medium neurons of DRG. 
PN5 mRNA is also present in brain and spinal cord. Inhibiting its activities may help prevent 
ailments such as headaches and migraines. The ability to inhibit the activity of these sodmm 
channels, i.e., reduce the conduction of nerve impulses, will affect the nerve's ability to 

25 transmit pain impulses. Selective inhibition of sodium channels in sensory neurons such as 
DRG will allow the blockage of pain impulses without complicating side effects caused by 
inhibition of sodium channels in other tissues such as brain and heart. In addition, certain 
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diseases are caused by sodium channels that produce ,mpulses at an extremely high frequency. 
The ability to reduce the activity of the channel can then eliminate or alleviate the disease. 
Accordingly, potential therapeutic compounds can be screened by methods well known in the 
art to discover whether they can inhibit the activity of the recombinant sodium channel of the 
5 invention. Barram, M. et al, Naun-Schmkdeberg's Archives of Pharmacology 347, 125-132 
(1993) and McNeal, E.T. et al, J. Med. Chem. 28, 381-388 (1985). For similar studies with 
the acetyl choline receptor, see, Claudio et al, Science 238, 1688-1694 (1987). 

For example, pain can be alleviated by inhibiting the activity of the novel preferably 
TTX-resistant sodium channel comprising administering a therapeutically effective amount of 
10 a compound having an IC 50 approximately 10 uM or less, preferably <1 uM. Potential 
therapeutic compounds are identified based on their ability to inhibit the activity of PN5. 
Therefore, the aforementioned assay can be used to identify compounds having a 
therapeutically effective IC 50 . 

The term „IC 50 " refers to the concentration of a compound that is required to inhibit by 
15 50% the activity of expressed PN5 when activity is measured by electrophysiology, flux 
assays, and toxin-binding assays, as mentioned above. 
6. Diagnostic Assays. 

The basic molecular biology techniques employed in accomplishing features of this 
invention, such as RNA, DNA and plasmid isolation, restriction enzyme digestion, preparation 
20 and probing of a cDNA library, sequencing clones, constructing expression vectors, 

transforming cells, maintaining and growing cell cultures, and other general techniques are 
well known in the art, and descriptions of such techniques can be found in general laboratory 
manuals such as Molecular Cloning: A Laboratory Manual by Sambrook et al. (Cold Spring 
Harbor Laboratory Press, 2nd edition, 1989). 

For example, the polynucleotides of the invention can be bound to a „reporter 
molecule" to form a polynucleotide probe useful for Northern and Southern blot analysis and 
in situ hybridizations. 

17 
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The term "reporter molecule" refers to a chemical entity capable of being detected by a 
suitable detection means, including, but not limited to, spectrophotometric, chemiluminescent, 
immunochemical, or radiochemical means. The polynucleotides of this invention can be 
conjugated to a reporter molecule by techniques well known in the art. Typically the reporter 

5 molecule contains a functional group suitable for attachment to or incorporation into the 
polynucleotide. The functional groups suitable for attaching the reporter group are usually 
activated esters or alkylating agents. Details of techniques for attaching reporter groups are 
well known in the art. See, for example, Matthews, J.A., Batki, A., Hynds, C, and Kricka, 
L.J., Anal. Biochem. 151, 205-209 (1985) and Engelhardt et al, European Patent Application 

10 No. 0302175. 



Accordingly, the following Examples are merely illustrative of the techniques by 
which the invention can be practiced. 

Abbreviations 

15 The following abbreviations are used throughout the Examples and have each of the 

respective meanings defined below. 
BSA: bovine serum albumin 

Denhardt's solution: 0.02% BSA, 0.02% polyvinyl-pyrrolidone, 0.02% Ficoll (0.1 g 
BSA, 0. 1 g Ficoll and 0. 1 g polyvinylpyrrolidone per 500 ml) 
20 DRG: dorsal root ganglia 

EDTA: Ethylenediaminetetraacetic acid, tetrasodium salt 

MEN: 20 mM MOPS, 1 mM EDTA, 5 mM sodium acetate, pH 7.0 

MOPS: 3-(N-morpholino)propanesulfonic acid (Sigma Chemical Company) 

PN5: peripheral nerve sodium channel 5 
25 PNS: peripheral nervous system 

SDS: sodium dodecyl sulfate 

SSC: 150 mM NaCl, 15 mM sodium citrate, pH 7.0 

18 



SSPE: 80 mM NaCl, 10 mM sodium phosphate, 1 mM ethylenediaminetetraacetate, pH 
8.0 

TEV: two electrode voltage clamp 

TTX: tetrodotoxin (Sigma Chemical Company) 
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F.XAMPLES 

The following Examples illustrate practice of the invention. 
Materials 

The plasmid pBK-CMV was obtained from Stratagene (La Jolla, CA); the plasmid 
5 pBSTA is described by Goldin et al, in Methods in Enzymology (Rudy & Iverson, eds.) 207, 
279-297; the plasmid pCIneo was obtained from Promega (Madison, WI); and the plasmid 
pCRII was obtained from Invitrogen (Carlsbad, CA). 

The oocyte expression vector plasmid pBSTAcIlr was constructed from 
pBSTA by insertion of a synthetic oligonucleotide linker; plasmid pKK232-8 was obtained 
10 from Pharmacia Biotech (Piscataway, NJ); plasmid pCPJI was obtained from Invitrogen, San 
Diego, CA. Competent E. coli cell lines STBL2™ and SURE® were obtained from 

Gibco/BRL and Stratagene, respectively. 

EXAMPLE 1 

ORTATNTNG FROM RAT rmfi BR ATN AND SPTNAL CORD 

15 

Lumbar DRG No. 4 and No. 5 (L4 and L5) brain and spinal cord were removed from 
anesthetized adult male Sprague-Dawley rats under a dissecting microscope. The tissues were 
frozen in dry ice and homogenized with a Polytron homogenizer; the RN A was extracted by 
the guanidine isothiocyanate procedure (see Chomczynksi et al. Anal. Biochemistry 162, 156- 
20 159 (1987)). Total RNA (5 |xg of each sample) was dissolved in MEN buffer containing 50% 
formamide, 6.6% formaldehyde and denatured at 65°C for 5-10 min. The RNA was 
electrophoresed through a 0.8% agarose gel containing 8.3% formaldehyde in MEN buffer. 
The electrode buffer was MEN buffer containing 3.7% formaldehyde; the gel was run at 50 V 
for 12-18 hours. 

25 Size markers, including ribosomal 18S and 28S RNAs and RNA markers (GIBCO 

BRL), were run in parallel lanes of the gel. Their positions were determined by staining the 
excised lane with ethidium bromide (0.5 ug/ml) followed by photography under UV light. 
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After electrophoresis, the gel was rinsed in 2xSSC and the RNA was transferred to a 
Duralose membrane (Stratagene) with 20xSSC by capillary action; the membrane was baked 
under vacuum at 80°C for 1 hour. 
5 EXAMPLE 2 

PROBE FROM RAT BRAIN IIA 

A 32 P-labeled cRNA probe complementary to nucleotides 4637-5868 of the rat brain 
HA sodium channel cc-subunit sequence was synthesized in vitro with T7 RNA polymerase 
10 (Pharmacia) using pEAF8 template DNA, (Noda et al, Nature 320, 188-192 (1986)) that had 

been linearized with BstEII. 

Protocols for each procedure mentioned above can be found in Molecular Cloning: A 
Laboratory Manual by Sambrook et al. (Cold Spring Harbor Laboratory Press, 2nd edition, 
1989). 

15 EXAMPLE 3 

HYBRIDIZATION OF RNA WITH THE PROBE FROM RAT B RAIN IIA 

The membrane of Example 1 was prehybridized in 50% formamide, 5xSSC, 50 mM 
20 sodium phosphate, pH 7.1, lx Denhardt's solution, 0.5% SDS, and sheared, heat-denatured 
salmon sperm DNA (1 mg/ml) for 16 hours at 42°C. The membrane was hybridized in 50% 
formamide, 5xSSC, 50 mM sodium phosphate, pH 7.1, lx Denhardt's solution, 0.5% SDS, and 
sheared, heat-denatured salmon sperm DNA (200 ^ig/ml) with the 32 P-labeled cRNA probe (ca. 
l-3xl0 6 cpm/ml) described in Example 2 for 18 hours at 42°C. 
25 The membrane was rinsed with 2xSSC, 0.1% SDS at room temperature for 20 min. and 

then washed sequentially with: 2xSSC, 0.1%- SDS at 55°C for 30 min., 0.2xSSC, 0.1% SDS 
at 65°C for 30 min., 0.2xSSC, 0.1% SDS at 70°C for 30 min., and 0.2xSSC, 0.1% SDS, 0.1% 
sodium pyrophosphate at 70°C for 20 min. The filter was exposed against Kodak X-omat AR 
film at -80°C with intensifying screens for up to 2 weeks. 
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The pEAFS probe hybridized to mRNAs in the DRG sample with sizes of 1 1 kb, 9.5 
kb, 7.3 kb, and 6.5 kb, estimated on the basis of their positions relative to the standards. 

FX AMPLE 4 

5 NOVF.T . SODIUM CHANNEL p nMATN TV PROBE 

The probe was obtained as follows: RT-PCR was performed on RNA isolated from rat 
DRG using degenerate oligonucleotide primers that were deigned based on the homologies 
between known sodium channels in domain IV. The domain IV products were cloned into a 
10 plasmid vector, transformed into E. coU and single colon.es isolated. The domain IV specific 
PGR products obtained from several of these colonies were individudly sequenced. Cloned 
novel domain IV sequence was as follows (SEQ ED NO: 4): 

! CTCAACATGG TTACGATGAT GGTGGAGACC GACGAGCAGG GCGAGGAGAA 
51 GACGAAGGTT CTGGGCAGAA TCAACCAGTT CTTTGTGGCC GTCTTCACGG 
15 101 GCGAGTGTGT GATGAAGATG TTCGCCCTGC GACAGTACTA TTTCACCAAC 
151 GGCTGGAACG TGTTCGACTT CATAGTGGTG ATCCTGTCCA TTGGGAGTCT 
201 GCTGTTTCT GCAATCCTTA AGTCACTGGA AAACTACTTC TCCCCGACGC 
251 TCTTCCGGGT CATCCGTCTG GCCAGGATCG GCCGCATCCT CAGGCTGATC 
301 CGAGCAGCCA AGGGGATTCG CACGCTGCTC TTCGCCCTCA TGATGTCCCT 
20 351 GCCCGCCCTC TTCAACATCG GCCTCCTCCT CTTCCTCGTC ATGTTCATCT 
401 ACTCCATCTT CGGCATGGCC AGCTTCGCTA ACGTCGTGGA CGAGGCCGGC 
451 ATCGACGACA TGTTCAACTT CAAGACCTTT GGCAACAGCA TGCTGTGCCT 
501 GTTCCAGATC ACCACCTCGG CCGGCTGGGA CGGCCTCCTC AGCCCCATCC 
551 TCAACACGGG GCCTCCCTAC TGCGACCCCA ACCTGCCCAA CAGCAACGGC 
25 601 TCCCGGGGGA ACTGCGGGAG CCCGGCGGTG GGCATCATCT TCTTCACCAC 
651 CTACATCATC ATCTCCTTCC TCATCGTGGT CAACATGTAT ATCGCAGTCA 
701 TC 



This sequence was labeled with 32 P by random priming. 



30 
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FY AMPLE 5 

iu]iutin ^nrrw^ — sodium cmmm^mLmm 

5 A Northern blot was prepared with lOug total RNA from rat brain, spinal cord, and 

DRG. The blot was hybridtzed with a cRNA probe from the 3'-UTR. The 3'-UTR was 
cloned into pSP 73 vector, the cRNA transcribed using a Trans Probe T lot (Pharmacia 
Biotech) and 32 P UTP. The blot was prehybridized for 2 hours at 65°C in a solution 
containing 5XSSC, IX Denhardt's solution, 0.5% SDS, 50mM sodium phosphate, pH 7.1. 

10 salmon sperm DNA (Img/ml) and 50% fonnamide. Hybridization was conducted at 45°C for 
18 hours in the above solution except that the salmon sperm DNA was included at a 
concentration of 200ug/ml and the 32 P -labeled probe was added at 7.5*1<* cpm.ml solution. 
The blot was subsequently washed three times at 2XSSC and 0.1% SDS at room temperature, 
once with 0.2XSSC and 0.1% SDS at 65°C for 20 min., and once with 0.2XSSC, 0.1% SDS 

15 and 0.1% sodium pyrophosphate at 65°C for 20 min. The blot was analyzed on a 

Phospholmager (BioRad) after an exposure of 2 days. The results indicated that there was a 
~6.5kb band signal present in brain only in the lane containing RNA from DRG. Because of 
the lower abundance of PN5 mRNA, as evidenced by the RT-PCR experiment, the 6.5kb band 
was not detectable in brain and spinal cord. 

20 EXAMPLE 6 

mNrgTRTTCTnON » srnFRNING O F cDNA LIBR ARY FROM RAT DRG 

25 An EcoRI-adapted cDNA library was prepared from normal adult male Sprague- 

Dawley rat DRG poly(A)f RNA using the Superscript Choice System (GffiCO BRL). cDNA 
(>4 kb) was selected by sucrose gradient fractionation as described by Kieffer, Gene 109, 115- 
1 19 (1991). The cDNA was then ligated into the Zap Express vector (Stratagene), and 
packaged with the Gigapack II XL lambda packaging extract (Stratagene). Similarly, a >2kb 

30 DRG cDNA library was synthesized. 
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Phage (3.5xl0 5 ) were screened by filter hybridization with a 32 P-labeled probe (rBOa, 
bases 4637-5868 as follows of Auld el al. Neuron 1, 449-461 (1988)). Filters were hybridized 
in 50% formamide, 5X SSPE, 5X Denhardfs solution, 0.5% SDS, 250ug/ml sheared, 
denatured salmon sperm DNA, and 50 mM sodium phosphate at 42°C and washed in 0.5X 

SSC/0.1% SDS at 50°C. 

Southern blots of EcoRI-digested plasmids were hybridized with the 32 P-labeled DNA 
probe, (SEQ ID NO: 4). The filters were then hybridized in 50% formamide, 6X SSC, 5X 
Denhardfs solution, 0.5%, SDS, and lOOug/ml sheared, denatured salmon sperm DNA at 
42°C and were washed in 0.1X SSC/0.1% SDS at 65°C. 

Positive clones were excised in vivo into pBK-CMV using the ExAssist/XLOLR 

system (Stratagene). 

EXAMPLE 7 

15 PT .ONES AN D TsTTTCT .EQTTDE ANALYSIS 

cDNA clones, 26.2 and 25.1 were isolated from the >4kb DRG cDNA library and 
clone 1.18 was isolated from the >2kb DRG cDNA library. By sequence analysis, 26.2 
appeared to be a full-length cDNA encoding a novel sodium channel and 25.1 extended from 
20 domain II to the S'-UTR. However, each had a deletion which truncated the coding region. 
Clone 1.18 had the 3'- untranslated region, in addition to the C-terminus of the deduced amino 
acid sequence of PN5. The construct in the expression vector, pBSTACIIr, consisted of 

sequences from 26.2 and 1.18. 

PN5 homology to other known sodium channels was obtained using the GAP/Best Fit 

25 (GCG) program: 

Channel % Similarity % Identity 

PN3a 71 54 

30 hPN3 l\ l\ 

PN4 71 *i 

PN4a 71 53 
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PN1 

rat brain type I 

rat brain type II 

rat brain type IE 

rat cardiac channel 

rat skeletal muscle channel 



72 55 

72 55 
71 54 
71 54 

73 56 
71 53 



Stahilizm p the PN5 full len gth cDNA 

10 A. Media, E. coli reX\ lines, and prnwth conditions: 

Growth of fragments of PN5 could be accomplished under standard conditions; 
however growth of plasmids containing full length constructs of PN5 (in pCIneo, pBSTAcIIr, 
and other vectors) could not be accomplished without use of special growth media, conditions, 
and E coli strains. The following proved to be optimal: (1) use of E. coli STBL2™ for 
15 primary transformation following ligation reactions and for large scale culturing; (2) solid 

media was l/2x FM (see below) plus Ix LB (Tryptone, 1%, Yeast Extract, 0.5%, NaCl, 0.5%), 
plus 15g/L agar, or lxFM plus l/2x LB; (3) liquid media optimally was lx FM plus l/2x LB; 
(4) carbenicillin, lOOug/ml, was used for all media, as it is metabolized less rapidly than 
ampicillin; (5) temperature for growth should be no greater than 30°C, usually 24-26°C; this 
20 necessitated longer growth periods than normally employed, from 24 to 72 hours. 
2x Freezing Medium (2xFM) : 



25 



K2HP04 


12.6g 


Na3Citrate 


0.9g 


MgSO4.7H20 


0.18g 


(NH4)2S04 


1.8g 


KH2P04 


3.6g 


Glycerol 


88g 


H20 


qs to IL 



2x FM and the remaining media components are prepared separately, sterilized by autoclaving, 
30 cooled to at least 60°C, and added together to form the final medium. Carbenicillin is prepared 
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at 25mg/ml H20 and sterilized by filtration. 2x FM was first described for preparation of 
frozen stocks of bacterial cells (Practical Methods in Molecular Biology, Schleif, R.F. and 
Wensink, P.C., Springer-Verlag, New York (1981) pp. 201-202). 



5 B. Fv pression Vectors 

In order to provide for increased stability of the full length cDNA, the oocyte 
expression vector pBSTAcIIr was modified to reduce plasmid copy number when grown in £. 
coli and to reduce possible read-through transcription from vector sequences that might result 
in toxic cryptic expression of PN5 protein, Brosius J., Gene 27, 151-160(1984). pBSTAcIIr 

10 was digested with PvuH The 755 bp fragment containing the T7 promoter, B-globin 5'UTR, 
the multiple cloning site, B-globin 3'UTR, and T3 promoter was ligated to the 3.6 kb fragment 
containing the replication origin, ampicillin resistance gene, rmBT, and rrnBT,^ 
transcription terminators from P KK232-8, which had been fully digested with Smal and 
partially digested with PvuH and treated with shrimp intestinal phosphatase to prevent self 

15 ligation. The resulting plasmid in which the orientation of the pBSTA fragment is such that 
the T7 promoter is proximal to the rrnBT, terminator was identified by restriction mapping 
and named pHQ8. As is the case with pBSTA, the direction of transcription of the ampicillin 
resistance gene and replication origin of P HQ8 is opposite to that of the gene expression 
cassette, and the presence of the rrnB T, terminator should reduce any remaining read-through 

20 from the vector into the T7 promoter driven expression cassette. 
C. Assembly of f" H l«n pth cDN A for expression 

Since pBK-CMV.26.2 had a 58 bp deletion (corresponding to bp 4346 to 4403 of SEQ 
ID NO: 1) and the sequnce of pBK-CMV.1.18 begins at bp 4180 of SEQ ID NO: 1, pBK- 
CMV.1.18 could be used to „repair" pBK-CMV.26.2. A strategy was developed to assemble a 
25 full length cDNA from clones pBK-CMV.26.2 and P BK-CMV.1.18 in three sections, 

truncating the 5' and 3' UTRs and introducing unique restriction sites at the 5' and 3' ends in 
the process. The 5' end 
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was generated by PCR from 26.2, truncating the 5' UTR by incorporating a Sail site just 
upstream of the start codon. The central section was a restriction fragment from 26.2. The 3' 
end was prepared by overlap PCR from both 26.2 and 1.18 and incorporating an Xbal site just 
down stream of the stop codon. These sections were digested at unique restriction sites and 
5 assembled in pBSTAcIIr. Although this construct appeared to have a correct sequence, upon 
recloning as a Sail to Xbal fragment into pCIneo, two type of isolates were found, one with a 
deletion and one with an 8 bp insertion. Reexamination of the pBSTAcIIr clone showed the 
sequence was „mixed" in this region, so that the clone must have rearranged. The 8 bp 
insertion was found as a repeat of one of the members of an 8 bp duplication in the native 
10 sequence, forming a triple 8 bp repeat in the rearranged isolate. Numerous cloning attempts 
inevitably gave rise to this rearrangement. Overlap PCR was used to introduce silent 
mutations into one of the 8 bp repeats, and a fragment containing this region was included 
when the PN5 coding region was assembled into HQS, the low-copy number version of 
pBSTAcIIr, to give plasmid HR-1. This sequence proved to be stable (see Figures 5A-E, SEQ 
15 ID NO: 5). 

The 5' end fragment was prepared by PCR using pBK-CMV.26.2 DNA as template 
and primers 4999 (pTTGGTCGACTCTAGATCAGGGTGAAGATGGAGGAG; Sail site 
underlined, PN5 homology in italics, corresponding to bp 58-77 of SEQ ID NO: 1, initiation 
codon in bold) and 4927 (GGGTTCAATGTGGTTTTATCT, corresponding to bp 1067 to 
20 1047 of SEQ ID NO: 1), followed by gel purification, digestion with Sail and Kpnl (Kpnl site 
at pb 1003-1008, SEQ ID NO: 1), and gel purification. 

The central 3.1 kb fragment was prepared by digestion of pBK-CMV.26.2 DNA with 
Kpnl and AatH (Aatfl site at 4133-4138), followed by gel purification. 

The 3' end fragment was prepared as follows: PCR using primers 4837 
25 (TCTGGGAAGTTTGGAAG, corresponding to bp 3613 to 3629 of SEQ ID NO: 1) and 4931 
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(GACCACGAAGGCTATGTTGAGG, correspond to bp 4239 to 42,8 of SEQ ID NO: 0 
on pBK-CMV.26.2 DNA as tempiate 6 ave a fragment of 0.6 kb. PGR using primers 4930 
(CCTCAACATAGCCrrCGTGGTC, corresponding ,0 bp 4218 ,0 4239 of SEQ ID NO: 1) 
and 4929 { / 3 TCncmATGAGGGTTCAGTCATTGTG, Xbal site underlined. PN5 
5 no.no.ogy in itahcs, conesponding .0 pb 5386 to 5365 of SEQ ID NO: I. stop codon in bold, 
on pBK-CMV.1.18 DNA as template gave afragmen, of 1.2 kb, introducing a Xbal sue 7 bp 
from the stop codon. Thus .he 3' end of the 4837-493. fragment exactly complements the 5 
end of the 4930-4929 fragment. These two fragments were ge, purified and a fraction of each 
combined as ,emp.ate in a PCR reaction using primers 4928 (CAAGCCTTTGTGTTCGAC, 
10 corresponding to bp 4084 to 410. of SEQ ID NO: 1) and 4929. to give a fragment of 1.3 kb. 
Th,s fragment was ge, purifed. diges.ee with Aafl, and Xbal, and the 1.2 Kb fragment ge, 
purified. 

The V end fragment was cloned into Aafll and Xbal digested pBSTAcHr. One is.oate 
was digested with SaH and Kpnl and ugated to the S - end fragment. The resuiting p,asmid, 
15 after sequence venftcation, was digested with Kpnl and Aatn and ,iga.ed to u,e cental 3.1 kb 
fragment, to form p BSTAcIIr.PN5(c.one 21). pBSTAcBr.PNS (clone 2!) was dtgested w,.h 
Sail and Xbal to release the 5.3 kb PN5 fragment which was cloned into Sail and Xbal 
digested pCIneon. Multiple isolates were found, of which GPII-1, which was completely 
seo.uence^, was typica, and contained an 8 bp insert. This CAGAAGAA, ate pb 3994 of 
20 SEQ ID NO: 1, converted the direct repeat of this sequence at thrs location into a tnple drree, 
repeat, causing a shift in the readrng frame. In an attempt to repair this defect, pBSTAcDr. 
PN5 (clone 2,) was digested wth Nhel (bp 2538-2543 SEQ ID NO: 1) and Xhol (bp 4828- 
4833, SEQ ED NO: 1) to give a 6.2 kb fragment and with Aafll and Xhol to give a 0.7 kb 
fragment which were ligated to the 1.6 kp fragment resulting from digestion of pBK- 
25 CMV.26.2 with Aafll and Nhel. Although no isoiates were found which were completely 
correct, one isolate, HA-4, had only a single base 



28 



change, deletion of the C at bp 4827 (SEQ ID NO: 1) adjacent to the Xhol site. 

In order to prevent the 8 bp insertion rearrangement from occurring, three silent 
stations were introduced in the 5 • repeat, and two additional mutations in a string ot Ts 

..I •jqro tr> 4014 SEO ED NO: 1; mutation sites 

would also be introduced, as shown below (bp 3982 to 4014, iv 

5 underlined, 8 bp repeats in native sequence in italics): 

GAC ATT TTT ATG ACA GAA GAA CAG AAG AAA TAT 
Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr 
GAC ATC TTC ATG ACT GAG GAG CAG AAG AAA TAT 

l0 As isolate HA-4 had the native direct repeat sequence (as opposed to e.g. 

pB STAcIIr.PN5 (clone 21)) and the region near the Xhol site defect would not be involved, it 
was used as template DNA for the following PCR reactions. Primer P5-3716S 
(CCGAAGCCAATGTAACATTAGTAATTACTCGTG, corresponding to pb 3684 to 3716, 

SEQ ED NO: 1) was paired with primer P5-3969AS 
15 (GCTCCTC AGTC ATGA AQATGTCTTGGCC ACCT AAC , correspond to bp 4003 to 3969, 
SEQ ID NO: 1, mutated bases arc underlined ) to give a 320 bp product. Primer P5-4017S 
(GGCCAAGACATCrrCATGACTGASGAQCAGAAGAAATATTAC, corresponding to bp 

3976 to 4017, SEQ ID NO: 1; mutated bases are underlined) was paired with primer P5- 
4247 AS (CTCAAAOCAAAGACTITGATG AGACACTCTATGG, corresoinding ,0 bp 4280 
20 to 4247, SEQ ID NO: 1) to give a 305 bp product. The J- er,d of the 320 bp fragment thus has 
a 28 bp exact match to the 5' end of the 305 bp fragment. The two bands were gel purified 
and a fraction of each combined in a new PCR reaction with primers P5-37 16S and P5- 
4247 AS to give a 597 bp product, which was T/A cloned tato vector pCWI. Isolate HO-7 was 
found to have the desired sequence. A four-way ligation was performed to assemble the full- 
25 length, modified PN5: 
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the oocyte expression vector HQ-8 ws digested with Sail and Xbal to give a 4.4 kb vector 
fragment; GPII-1 was digested wtih Sail and Mlul to give a 3.8 kb fragment containing the 5' 
half of PN5; HO-7 was digested with Mlul (bp 3866 to 3871, SEQ ID NO: 1) and Aatll to give 
a 0.3 kb fragment containing the mutant 8 bp repeat region of PN5; GPH-1 was Rested with 
5 Aatn and Xbal to give the remaining 1.3 kb 3' portion of PN5. A portion of the ligation 
reaction was transformed into E. coli Stable 2 cells. Of the 9.6 kb isolates containing all four 
fragments, HR-1 was sequenced and found to have the desired 5.4 kb sequence. These isolates 
grew well and showed no tendency to rearrange. The sequence of this engineered version of 
PN5 is shown in Figures 5A-E (SEQ ID NO: 5). 

EXAMPLE 8 
HUMAN PN5 

15 An 856 bp clone (Figure 3A, SEQ ID No.: 3) has been isolated from a human dorsal 

root ganglia (DRG) cDNA library that is most closely related to rat PN5 with 79% identity for 
the amino acid sequence. The human PN5 sequence spans the region between IES1 and 
interdomain HI/TV which includes the fast inactivation gate (i.e., IFM) that is located within 
interdomain in/TV. 

20 The human DRG cDNA library was constructed from lumbar 4 and 5 DRG total RNA 

that was randomly primed. First strand cDNA was synthesized with Superscript II reverse 
transcriptase (GIB CO BRL) and the second strand synthesis with T4 DNA polymerase. EcoRI 
adaptors were ligated to the ends of the double stranded cDNAs and the fragments cloned into 
the ZAP H vector (Stratagene). The library was screened with digoxigenin-labeled rat PN3, 

25 rat PN1 and human heart hHl probes. Positive clones were sequenced and compared to 
known human and rat sodium channel sequences. Only the aforementioned clone was 
identified as human PN5 sequence. 

Channel % Similarity % Identity 

30 

Human Brain (HBA) 76 69 

Human Heart (hHl) 81 74 

30 
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Human Atypical Heart 60 52 

Human Skeletal Muscle 80 / j 

Human Neuroendocrine 78 

Human PN3 11 !? 

RatPNl l\ ?1 

RatPN3 78 71 

RatPN4 II 'I 

RatPN5 86 /y 

Figure 3B compares the amino acid sequence of the hPN5 fragment with the rat PN5 
amino acid sequence in the appropriate region. 

F.X AMPLE 9 
TTSSTTF DISTRIBUTION BY RT-PCR 



Brain, spinal cord, DRG, nodose ganglia, superior cervical ganglia, sciatic nerve, heart 
and skeletal muscle tissue were isolated from anesthetized, normal adult male Sprague-Dawley 
rats and were stored at -80°C. RNA was isolated from each tissue using RNAzol (Tel-Test, 
20 Inc.). Random-primedcDNAwasreversetranscribedfromSOOngofRNAfromeachtissue. 
The forward primer (CAGATTGTGTTCTCAGTACATTCC) and the reverse primer 
(CCAGGTGTCTAACGAATAAATAGG) were designed from the 3 '-untranslated region to 
yield a 252 base pair fragment. The cycle parameters were: 94°C/2 min. (denaturation), 
94°C/30 sec, 65°C/30 sec. and 72°C/lmin. (35 cycles) and 72°C/4 min. The reaction 
25 products were analyzed on a 4% agarose gel. 

A positive control and a no-template control were also included. cDNA from each 
tissue was also PCR amplified using primers specific for glyceraldehyde-3-phosphate 
dehydrogenase to demonstrate template viability, as described by Tso et al, Nucleic Acid Res. 
13, 2485-2502 (1985). 

30 Tissue distribution profile of rPN5 by analysis of RNA from selected rat tissues by RT- 

PCR was as follows: 

Tissue RiP^I3J^ycJes) 



Brain + 
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Spinal cord 
DRG 

Nodose ganglia 
Superior cervical ganglia + 
Sciatic nerve 
Heart 

Skeletal muscle 
Fll-untreated + 
Fll -treated + 
PN5 was also detected after only 25 cycles (24 + 1) in the same five tissues as above in 

the same relative abundance. 

FX AMPLE 10 
ANTIBODIES 

A synthetic peptide (26 amino acids in interdomain H and m - residues 977 to 1002) 
was conjugated to KLH and antibody raised in rabbits. The antiserum was subsequently 
affinity purified. 

PN5 constitutes a subfamily of novel sodium channel genes; these genes are different from 
those detectable with other probes (e.g., PEAF8 and PN3 probes). 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity and understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended claims. 
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SEQUENCE LISTING 



(1) 



GENERAL INFORMATION: 



(i) APPLICANT: 



(A) 
(B) 
(C) 
(D) 
(E) 
(F) 
(G) 
(H) 
(I) 



NAME: F. HOFFMANN- LA ROCHE AG 
STREET: Grenzacherstrasse 124 
CITY: Basle 
STATE: BS 

COUNTRY : Switzerland 
POSTAL CODE (ZIP) : CH-4010 
TELEPHONE: 061-6884256 
TELEFAX: 061-6881395 
TELEX: 962292/965542 hlr ch 



(ii) TITLE OF INVENTION: Nucleic Acid Encoding a Nervous Tissue 
Sodium Channel 

(iii) NUMBER OF SEQUENCES: 5 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release ( 1.0, Version ( 1.3 0 

(v) CURRENT APPLICATION DATA 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat 

(F) TISSUE TYPE: Dorsal root ganglia 

(G) CELL TYPE: Peripheral nerve 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GAAGTCACAG GAGTGTCTGT CAGCGAGAGG AAGAAGGGAG AGTTTACTGA GTGTCTTCTG 60 

CCCCTCCTCA GGGTGAAGAT GGAGGAGAGG TACTACCCGG TGATCTTCCC GGACGAGCGG 120 

AATTTCCGCC CCTTCACTTC CGACTCTCTG GCTGCCATAG AGAAGCGGAT TGCTATCCAA 180 

AAGGAGAGGA AGAAGTCCAA AGACAAGGCG GCAGCTGAGC CCCAGCCTCG GCCTCAGCTT 240 

GACCTAAAGG CCTCCAGGAA GTTACCTAAG CTTTATGGTG ACATTCCCCC TGAGCTTGTA 300 

GCGAAGCCTC TGGAAGACCT GGACCCATTC TACAAAGACC ATAAGACATT CATGGTGTTG 3 60 
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AACAAGAAGA GAACAATTTA TCGCTTCAGC GCCAAGCGGG CCTTGTTCAT TCTGGGGCCT 420 

TTTAATCCCC TCAGAAGCTT AATGATTCGT ATCTCTGTCC ATTCAGTCTT TAGCATGTTC 480 

ATCATCTGCA CGGTGATCAT CAACTGTATG TTCATGGCGA ATTCTATGGA GAGAAGTTTC 54 0 

GACAACGACA TTCCCGAATA CGTCTTCATT GGGATTTATA TTTTAGAAGC TGTGATTAAA 6 00 

ATATTGGCAA GAGGCTTCAT TGTGGATGAG TTTTCCTTCC TCCGAGATCC GTGGAACTGG 660 

CTGGACTTCA TTGTCATTGG AACAGCGATC GCAACTTGTT TTCCGGGCAG CCAAGTCAAT 720 

CTTTCAGCTC TTCGTACCTT CCGAGTGTTC AGAGCTCTGA AGGCGATTTC AGTTATCTCA 780 

GGTCTGAAGG TCATCGTAGG TGCCCTGCTG CGCTCGGTGA AGAAGCTGGT AGACGTGATG 84 0 

GTCCTCACTC TCTTCTGCCT CAGCATCTTT GCCCTGGTCG GTCAGCAGCT GTTCATGGGA 900 

ATTCTGAACC AGAAGTGTAT TAAGCACAAC TGTGGCCCCA ACCCTGCATC CAACAAGGAT 96 0 

TGCTTTGAAA AGGAAAAAGA TAGCGAAGAC TTCATAATGT GTGGTACCTG GCTCGGCAGC 1020 

AGACCCTGTC CCAATGGTTC TACGTGCGAT AAAACCACAT TGAACCCAGA CAATAATTAT 1080 

ACAAAGTTTG ACAACTTTGG CTGGTCCTTT CTCGCCATGT TCCGGGTTAT GACTCAAGAC 1140 

TCCTGGGAGA GGCTTTACCG ACAGATCCTG CGGACCTCTG GGATCTACTT TGTCTTCTTC 12 00 

TTCGTGGTGG TCATCTTCCT GGGCTCCTTC TACCTGCTTA ACCTAACCCT GGCTGTTGTC 1260 

ACCATGGCTT ATGAAGAACA GAACAGAAAT GTAGCTGCTG AGACAGAGGC CAAGGAGAAA 1320 

ATGTTTCAGG AAGCCCAGCA GCTGTTAAGG GAGGAGAAGG AGGCTCTGGT TGCCATGGGA 1380 

ATTGACAGAA GTTCCCTTAA TTCCCTTCAA GCTTCATCCT TTTCCCCGAA GAAGAGGAAG 144 0 

TTTTTCGGTA GTAAGACAAG AAAGTCCTTC TTTATGAGAG GGTCCAAGAC GGCCCAAGCC 1500 

TCAGCGTCTG ATTCAGAGGA CGATGCCTCT AAAAATCCAC AGCTCCTTGA GCAGACCAAA 156 0 

CGACTGTCCC AGAACTTGCC AGTGGATCTC TTTGATGAGC ACGTGGACCC CCTCCACAGG 1620 

CAGAGAGCGC TGAGCGCTGT CAGTATCTTA ACCATCACCA TGCAGGAACA AGAAAAATTC 1680 

CAGGAGCCTT GTTTCCCATG TGGGAAAAAT TTGGCCTCTA AGTACCTGGT GTGGGACTGT 1740 

AGCCCTCAGT GGCTGTGCAT AAAGAAGGTC CTGCGGACCA TCATGACGGA TCCCTTTACT 18 00 

GAGCTGGCCA TCACCATCTG CATCATCATC AATACCGTTT TCTTAGCCGT GGAGCACCAC 1860 

AACATGGATG ACAACTTAAA GACCATACTG AAAATAGGAA ACTGGGTTTT CACGGGAATT 1920 

TTCATAGCGG AAATGTGTCT CAAGATCATC GCGCTCGACC CTTACCACTA CTTCCGGCAC 1980 

GGCTGGAATG TTTTTGACAG CATCGTGGCC CTCCTGAGTC TCGCTGATGT GCTCTACAAC 2 04 0 
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ACACTGTCTG ATAACAATAG GTCTTTCTTG GCTTCCCTCA GAGTGCTGAG GGTCTTCAAG 2100 

TTAGCCAAAT CCTGGCCCAC GTTAAACACT CTCATTAAGA TCATCGGCCA CTCCGTGGGC 2160 

GCGCTTGGAA ACCTGACTGT GGTCCTGACT ATCGTGGTCT TCATCTTTTC TGTGGTGGGC 222 0 

ATGCGGCTCT TCGGCACCAA GTTTAACAAG ACCGCCTACG CCACCCAGGA GCGGCCCAGG 2280 

CGGCGCTGGC ACATGGATAA TTTCTACCAC TCCTTCCTGG TGGTGTTCCG CATCCTCTGT 2340 

GGGGAATGGA TCGAGAACAT GTGGGGCTGC ATGCAGGATA TGGACGGCTC CCCGTTGTGC 24 00 

ATCATTGTCT TTGTCCTGAT AATGGTGATC GGGAAGCTTG TGGTGCTTAA CCTCTTCATT 24 6 0 

GCCTTGCTGC TCAATTCCTT CAGCAATGAG GAGAAGGATG GGAGCCTGGA AGGAGAGACC 2 520 

AGGAAAACCA AAGTGCAGCT AGCCCTGGAT CGGTTCCGCC GGGCCTTCTC CTTCATGCTG 2580 

CACGCTCTTC AGAGTTTTTG TTGCAAGAAA TGCAGGAGGA AAAACTCGCC AAAGCCAAAA 2640 

GAGACAACAG AAAGCTTTGC TGGTGAGAAT AAAGACTCAA TCCTCCCGGA TGCGAGGCCC 2700 

TGGAAGGAGT ATGATACAGA CATGGCTTTG TACACTGGAC AGGCCGGGGC TCCGCTGGCC 2 76 0 

CCACTCGCAG AGGTAGAGGA CGATGTGGAA TATTGTGGTG AAGGCGGTGC CCTACCCACC 2820 

TCACAACATA GTGCTGGAGT TCAGGCCGGT GACCTCCCTC CAGAGACCAA GCAGCTCACT 288 0 

AGCCCGGATG ACCAAGGGGT TGAAATGGAA GTATTTTCTG AAGAAGATCT GCATTTAAGC 2 94 0 

ATACAGAGTC CTCGAAAGAA GTCTGACGCA GTGAGCATGC TCTCGGAATG CAGCACAATT 3 000 

GACCTGAATG ATATCTTTAG AAATTTACAG AAAACAGTTT CCCCCAAAAA GCAGCCAGAT 306 0 

AGATGCTTTC CCAAGGGCCT TAGTTGTCAC TTTCTATGCC ACAAAACAGA CAAGAGAAAG 3120 

TCCCCCTGGG TCCTGTGGTG GAACATTCGG AAAACCTGCT ACCAAATCGT GAAGCACAGC 3180 

TGGTTTGAGA GTTTCATAAT CTTTGTTATT CTGCTGAGCA GTGGAGCGCT GATATTTGAA 3240 

GATGTCAATC TCCCCAGCCG GCCCCAAGTT GAGAAATTAC TAAGGTGTAC CGATAATATT 3300 

TTCACATTTA TTTTCCTCCT GGAAATGATC CTGAAGTGGG TGGCCTTTGG ATTCCGGAGG 33 60 

TATTTCACCA GTGCCTGGTG CTGGCTTGAT TTCCTCATTG TGGTGGTGTC TGTGCTCAGT 3420 
CTCATGAATC TACCAAGCTT GAAGTCCTTC CGGACTCTGC GGGCCCTGAG ACCTCTGCGG 3480 
GCGCtGTCCC AGTTTGAAGG AATGAAGGTT GTCGTCTACG CCCTGATCAG CGCCATACCT 3 54 0 
GCCATTCTCA ATGTCTTGCT GGTCTGCCTC ATTTTCTGGC TCGTATTTTG TATCTTGGGA 3600 
GTAAATTTAT TTTCTGGGAA GTTTGGAAGG TGCATTAACG GGACAGACAT AAATATGTAT 366 0 
TTGGATTTTA CCGAAGTTCC GAACCGAAGC CAATGTAACA TTAGTAATTA CTCGTGGAAG 3 720 
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GTCCCGCAGG TCAACTTTGA CAACGTGGGG AATGCCTATC TCGCCCTGCT GCAAGTGGCA 3780 

ACCTATAAGG GCTGGCTGGA AATCATGAAT GCTGCTGTCG ATTCCAGAGA GAAAGACGAG 3840 

CAGCCGGACT TTGAGGCGAA CCTCTACGCG TATCTCTACT TTGTGGTTTT TATCATCTTC 3 900 

GGCTCCTTCT TTACCCTGAA CCTCTTTATC GGTGTTATTA TTGACAACTT CAATCAGCAG 3960 

CAGAAAAAGT TAGGTGGCCA AGACATTTTT ATGACAGAAG AACAGAAGAA ATATTACAAT 4020 

GCAATGAAAA AGTTAGGAAC CAAGAAACCT CAAAAGCCCA TCCCAAGGCC CCTGAACAAA 4080 

TGTCAAGCCT TTGTGTTCGA CCTGGTCACA AGCCAGGTCT TTGACGTCAT CATTCTGGGT 4140 

CTTATTGTCT TAAATATGAT TATCATGATG GCTGAATCTG CCGACCAGCC CAAAGATGTG 4200 

AAGAAAACCT TTGATATCCT CAACATAGCC TTCGTGGTCA TCTTTACCAT AGAGTGTCTC 4260 

ATCAAAGTCT TTGCTTTGAG GCAACACTAC TTCACCAATG GCTGGAACTT ATTTGATTGT 432 0 

GTGGTCGTGG TTCTTTCTAT CATTAGTACC CTGGTTTCCC GCTTGGAGGA CAGTGACATT 4 3 80 

TCTTTCCCGC CCACGCTCTT CAGAGTCGTC CGCTTGGCTC GGATTGGTCG AATCCTCAGG 444 0 

CTGGTCCGGG CTGCCCGGGG AATCAGGACC CTCCTCTTTG CTTTGATGAT GTCTCTCCCC 4 500 

TCTCTCTTCA ACATCGGTCT GCTGCTCTTC CTGGTGATGT TCATTTACGC CATCTTTGGG 4560 

ATGAGCTGGT TTTCCAAAGT GAAGAAGGGC TCCGGGATCG ACGACATCTT CAACTTCGAG 4620 

ACCTTTACGG GCAGCATGCT GTGCCTCTTC CAGATAACCA CTTCGGCTGG CTGGGATACC 4680 

CTCCTCAACC CCATGCTGGA GGCAAAAGAA CACTGCAACT CCTCCTCCCA AGACAGCTGT 474 0 

CAGCAGCCGC AGATAGCCGT CGTCTACTTC GTCAGTTACA TCATCATCTC CTTCCTCATC 4800 

GTGGTCAACA TGTACATCGC TGTGATCCTC GAGAACTTCA ACACAGCCAC GGAGGAGAGC 4860 

GAGGACCCTC TGGGAGAGGA CGACTTTGAA ATCTTCTATG AGGTCTGGGA GAAGTTTGAC 4 920 

CCCGAGGCGT CGCAGTTCAT CCAGTATTCG GCCCTCTCTG ACTTTGCGGA CGCCCTGCCG 4 980 

GAGCCGTTGC GTGTGGCCAA GCCGAATAAG TTTCAGTTTC TAGTGATGGA CTTGCCCATG 5040 

GTGATGGGCG ACCGCCTCCA TTGCATGGAT GTTCTCTTTG CTTTCACTAC CAGGGTCCTC 5100 

GGGGACTCCA GCGGCTTGGA TACCATGAAA ACCATGATGG AGGAGAAGTT TATGGAGGCC 5160 

AACCCTTTTA AGAAGCTCTA CGAGCCCATA GTCACCACCA CCAAGAGGAA GGAGGAGGAG 5220 
CAAGGCGCCG CCGTCATCCA GAGGGCCTAC CGGAAACACA TGGAGAAGAT GGTCAAACTG 5280 
AGGCTGAAGG ACAGGTCAAG TTCATCGCAC CAGGTGTTTT GCAATGGAGA CTTGTCCAGC 5340 
TTGGATGTGG CCAAGGTCAA GGTTCACAAT GACTGAACCC TCATCTCCAC CCCTACCTCA 54 00 
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CTGCCTCACA GCTTAGCCTC CAGCCTCTGG CGAGCAGGCG GCAGACTCAC TGAACACAGG 5460 

CCGTTCGATC TGTGTTTTTG GCTGAACGAG GTGACAGGTT GGCGTCCATT TTTAAATGAC 5520 

TCTTGGAAAG ATTTCATGTA GAGAGATGTT AGAAGGGACT GCAAAGGACA CCGACCATAA 5580 

CGGAAGGCCT GGAGGACAGT CCAACTTACA TAAAGATGAG AAACAAGAAG GAAAGATCCC 5640 

AGGAAAACTT CAGATTGTGT TCTCAGTACA TCCCCCAATG TGTCTGTTCG GTGTTTTGAG 5700 

TATGTGACCT GCCACATGTA GCTCTTTTTT GCATGTACGT CAAAACCCTG CAGTAAGTTG 5760 

ATAGCTTGCT ACGGGTGTTC CTACCAGCAT CACAGAATTG GGTGTATGAC TCAAACCTAA 5820 

AAGCATGACT CTGACTTGTC AGTCAGCACC CCGACTTTCA GACGCTCCAA TCTCTGTCCC 5880 

5 908 

AGGTGTCTAA CGAATAAATA GGTAAAAG 

(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 5 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNE S S : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat 

(F) TISSUE TYPE: dorsal root ganglia 

(G) CELL TYPE: peripheral nerve 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Glu Glu Arg Tyr Tyr Pro Val lie Phe Pro Asp Glu Arg Asn Phe 

15 10 15 

Arq Pro Phe Thr Ser Asp Ser Leu Ala Ala He Glu Lys Arg lie Ala 

20 25 30 

He Gin Lys Glu Arg Lys Lys Ser Lys Asp Lys Ala Ala Ala Glu Pro 

35 40 45 

Gin Pro Arg Pro Gin Leu Asp Leu Lys Ala Ser Arg Lys Leu Pro Lys 

50 55 60 a 

Leu Tyr Gly Asp He Pro Pro Glu Leu Val Ala Lys Pro Leu Glu Asp 
65 70 75 80 

Leu Asp Pro Phe Tyr Lys Asp His Lys Thr Phe Met Val Leu Asn Lys 

85 90 95 

Lys Arg Thr He Tyr Arg Phe Ser Ala Lys Arg Ala Leu Phe He Leu 

100 105 HO 

Gly Pro Phe Asn Pro Leu Arg Ser Leu Met He Arg He Ser Val His 

115 120 I 25 

Ser Val Phe Ser Met Phe He He Cys Thr Val He He Asn Cys Met 

130 135 1*0 

Phe Met Ala Asn Ser Met Glu Arg Ser Phe Asp Asn Asp He Pro Glu 
1 45 150 155 16° 

Tvr Val Phe He Gly He Tyr He Leu Glu Ala Val He Lys He Leu 
y 165 170 175 
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Ala Arg Gly Phe He Val Asp Glu Phe Ser Phe Leu Arg Asp Pro Trp 

180 185 190 

Asn Trp Leu Asp Phe lie Val He Gly Thr Ala He Ala Thr Cys Phe 

195 200 205 

Pro Gly Ser Gin Val Asn Leu Ser Ala Leu Arg Thr Phe Arg Val Phe 

210 215 220 

Arq Ala Leu Lys Ala lie Ser Val He Ser Gly Leu Lys Val He Val 
225 230 235 240 

Gly Ala Leu Leu Arg Ser Val Lys Lys Leu Val Asp Val Met Val Leu 

245 250 255 

Thr Leu Phe Cys Leu Ser He Phe Ala Leu Val Gly Gin Gin Leu Phe 

260 265 270 

Met Gly He Leu Asn Gin Lys Cys He Lys His Asn Cys Gly Pro Asn 

275 280 285 

Pro Ala Ser Asn Lys Asp Cys Phe Glu Lys Glu Lys Asp Ser Glu Asp 

290 295 300 

Phe He Met Cys Gly Thr Trp Leu Gly Ser Arg Pro Cys Pro Asn Gly 
305 310 315 320 

Ser Thr Cys Asp Lys Thr Thr Leu Asn Pro Asp Asn Asn Tyr Thr Lys 

325 330 335 

Phe Asp Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Val Met Thr 

340 345 350 

Gin Asp Ser Trp Glu Arg Leu Tyr Arg Gin He Leu Arg Thr Ser Gly 

355 360 365 

He Tyr Phe Val Phe Phe Phe Val Val Val He Phe Leu Gly Ser Phe 

370 375 380 

Tyr Leu Leu Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu 
385 390 395 400 

Gin Asn Arg Asn Val Ala Ala Glu Thr Glu Ala Lys Glu Lys Met Phe 

405 410 415 

Gin Glu Ala Gin Gin Leu Leu Arg Glu Glu Lys Glu Ala Leu Val Ala 

420 425 430 

Met Gly He Asp Arg Ser Ser Leu Asn Ser Leu Gin Ala Ser Ser Phe 

435 440 445 

Ser Pro Lys Lys Arg Lys Phe Phe Gly Ser Lys Thr Arg Lys Ser Phe 

450 455 460 

Phe Met Arg Gly Ser Lys Thr Ala Gin Ala Ser Ala Ser Asp Ser Glu 
465 470 475 480 

Asp Asp Ala Ser Lys Asn Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu 

485 490 495 

Ser Gin Asn Leu Pro Val Asp Leu Phe Asp Glu His Val Asp Pro Leu 

500 505 510 

His Arg Gin Arg Ala Leu Ser Ala Val Ser He Leu Thr He Thr Met 

515 520 525 

Gin Glu Gin Glu Lys Phe Gin Glu Pro Cys Phe Pro Cys Gly Lys Asn 

530 535 540 

Leu Ala Ser Lys Tyr Leu Val Trp Asp Cys Ser Pro Gin Trp Leu Cys 
545 550 555 560 

He Lys Lys Val Leu Arg Thr He Met Thr Asp Pro Phe Thr Glu Leu 

565 570 575 

Ala He Thr He Cys He He He Asn Thr Val Phe Leu Ala Val Glu 

580 585 590 

His His Asn Met Asp Asp Asn Leu Lys Thr He Leu Lys He Gly Asn 
595 600 605 
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Trp Val Phe Thr Gly lie Phe lie Ala Glu Met Cys Leu Lys lie He 

610 615 620 

Ala Leu Asp Pro Tyr His Tyr Phe Arg His Gly Trp Asn Val Phe Asp 
625 630 635 640 

Ser He Val Ala Leu Leu Ser Leu Ala Asp Val Leu Tyr Asn Thr Leu 

645 650 655 

Ser Asp Asn Asn Arg Ser Phe Leu Ala Ser Leu Arg Val Leu Arg Val 

660 665 670 

Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu He Lys He 

675 680 685 

He Gly His Ser Val Gly Ala Leu Gly Asn Leu Thr Val Val Leu Thr 

690 695 700 

He Val Val Phe He Phe Ser Val Val Gly Met Arg Leu Phe Gly Thr 

n 715 720 

705 710 

Lvs Phe Asn Lys Thr Ala Tyr Ala Thr Gin Glu Arg Pro Arg Arg Arg 

725 730 735 

Trp His Met Asp Asn Phe Tyr His Ser Phe Leu Val Val Phe Arg He 

740 745 750 

Leu Cys Gly Glu Trp He Glu Asn Met Trp Gly Cys Met Gin Asp Met 

755 760 765 

Asp Gly Ser Pro Leu Cys He He Val Phe Val Leu He Met Val He 

770 775 780 

Gly Lys Leu Val Val Leu Asn Leu Phe He Ala Leu Leu Leu Asn Ser 
785 790 795 800 

Phe Ser Asn Glu Glu Lys Asp Gly Ser Leu Glu Gly Glu Thr Arg Lys 

805 810 815 

Thr Lys Val Gin Leu Ala Leu Asp Arg Phe Arg Arg Ala Phe Ser Phe 

820 825 830 

Met Leu His Ala Leu Gin Ser Phe Cys Cys Lys Lys Cys Arg Arg Lys 

835 840 845 

Asn Ser Pro Lys Pro Lys Glu Thr Thr Glu Ser Phe Ala Gly Glu Asn 

850 855 860 

Lvs Asp Ser He Leu Pro Asp Ala Arg Pro Trp Lys Glu Tyr Asp Thr 
865 870 875 880 

Asp Met Ala Leu Tyr Thr Gly Gin Ala Gly Ala Pro Leu Ala Pro Leu 

885 890 895 

Ala Glu Val Glu Asp Asp Val Glu Tyr Cys Gly Glu Gly Gly Ala Leu 

900 905 910 

Pro Thr Ser Gin His Ser Ala Gly Val Gin Ala Gly Asp Leu Pro Pro 

915 920 925 

Glu Thr Lys Gin Leu Thr Ser Pro Asp Asp Gin Gly Val Glu Met Glu 

930 935 940 

Val Phe Ser Glu Glu Asp Leu His Leu Ser He Gin Ser Pro Arg Lys 

ocn 955 960 

945 95° 

Lvs Ser Asp Ala Val Ser Met Leu Ser Glu Cys Ser Thr He Asp Leu 

965 970 975 

Asn Asp He Phe Arg Asn Leu Gin Lys Thr Val Ser Pro Lys Lys Gin 

980 985 990 

Pro Asp Arg Cys Phe Pro Lys Gly Leu Ser Cys His Phe Leu Cys His 

995 1000 1005 

Lys Thr Asp Lys Arg Lys Ser Pro Trp Val Leu Trp Trp Asn He Arg 

1010 1015 1020 

Lys Thr Cys Tyr Gin He Val Lys His Ser Trp Phe Glu Ser Phe He 
1025 1030 1035 1040 
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lie Phe Val He Leu Leu Ser Ser Gly Ala Leu lie Phe Glu Asp Val 

1045 1050 1055 

Asn Leu Pro Ser Arg Pro Gin Val Glu Lys Leu Leu Arg Cys Thr Asp 

1060 1065 1070 

Asn lie Phe Thr Phe lie Phe Leu Leu Glu Met lie Leu Lys Trp Val 

1075 1080 1085 

Ala Phe Gly Phe Arg Arg Tyr Phe Thr Ser Ala Trp Cys Trp Leu Asp 

1090 1095 1100 

Phe Leu lie Val Val Val Ser Val Leu Ser Leu Met Asn Leu Pro Ser 

1105 mo 1115 1120 

Leu Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu 

1125 ll 30 1135 

Ser Gin Phe Glu Gly Met Lys Val Val Val Tyr Ala Leu lie Ser Ala 

1140 H45 1150 

lie Pro Ala lie Leu Asn Val Leu Leu Val Cys Leu lie Phe Trp Leu 

1155 1160 H65 

Val Phe Cys He Leu Gly Val Asn Leu Phe Ser Gly Lys Phe Gly Arg 

1170 H75 "BO 

Cys lie Asn Gly Thr Asp lie Asn Met Tyr Leu Asp Phe Thr Glu Val 
1185 H90 1195 1200 

Pro Asn Arg Ser Gin Cys Asn lie Ser Asn Tyr Ser Trp Lys Val Pro 

1205 1210 1215 

Gin Val Asn Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin 

1220 1225 1230 

Val Ala Thr Tyr Lys Gly Trp Leu Glu lie Met Asn Ala Ala Val Asp 

1235 !240 I 245 

Ser Arg Glu Lys Asp Glu Gin Pro Asp Phe Glu Ala Asn Leu Tyr Ala 

1250 1255 12 60 

Tyr Leu Tyr Phe Val Val Phe lie He Phe Gly Ser Phe Phe Thr Leu 
1265 1270 1275 1280 

Asn Leu Phe lie Gly Val lie lie Asp Asn Phe Asn Gin Gin Gin Lys 

1285 1290 12 95 

Lvs Leu Gly Gly Gin Asp lie Phe Met Thr Glu Glu Gin Lys Lys Tyr 

1300 1305 1310 

Tyr Asn Ala Met Lys Lys Leu Gly Thr Lys Lys Pro Gin Lys Pro He 

1315 1320 1325 

Pro Arg Pro Leu Asn Lys Cys Gin Ala Phe Val Phe Asp Leu Val Thr 

1330 1335 1340 

Ser Gin Val Phe Asp Val He He Leu Gly Leu He Val Leu Asn Met 
1345 1350 1355 1360 

He He Met Met Ala Glu Ser Ala Asp Gin Pro Lys Asp Val Lys Lys 

1365 1370 1375 

Thr Phe Asp He Leu Asn He Ala Phe Val Val He Phe Thr He Glu 

"BO "85 1390 

Cvs Leu He Lys Val Phe Ala Leu Arg Gin His Tyr Phe Thr Asn Gly 

1395 1400 1405 

Trp Asn Leu Phe Asp Cys Val Val Val Val Leu Ser He He Ser Thr 

1410 "IS 14 20 

Leu Val Ser Arg Leu Glu Asp Ser Asp He Ser Phe Pro Pro Thr Leu 
1425 1430 1435 1440 

Phe Arg Val Val Arg Leu Ala Arg He Gly Arg lie Leu Arg Leu Val 

1445 1450 1455 

Arg Ala Ala Arg Gly lie Arg Thr Leu Leu Phe Ala Leu Met Met Ser 
1460 1465 1470 
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Leu Pro Ser Leu Phe Asn lie Gly Leu Leu Leu Phe Leu Val Met Phe 

1475 1480 14 

lie Tyr Ala He Phe Gly Met Ser Trp Phe Ser ^Val Lys Lys Gly 

Ser lly 'lie Asp Asp lie Phe^Asn Phe Glu Thr^Phe Thr Gly Ser Met^ 

Leu'cys Leu Phe Gin Il^Thr Thr Ser Ala Gly Trp Asp Thr Leu Leu 

1"5 1530 1535 

Asn Pro Met Leu Glu Ala Lys Glu His Cys Asn Ser Ser Ser Gin Asp 

1540 1545 1550 

Ser Cys Gin Gin Pro Gin lie Ala Val Val Tyr Phe Val Ser Tyr lie 

1555 1560 1565 

lie lie Ser Phe Leu lie Val Val Asn Met Tyr lie Ala Val He Leu 

1570 1575 1580 

Glu Asn Phe Asn Thr Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu 
1585 1590 I 595 "°° 

Asp Asp Phe Glu He Phe Tyr Glu Val TjpOlu Lys Phe Asp ProGlu 

Ala Ser Gin Phe Ue'cin Tyr Ser Ala Leu Ser Asp Phe Ala Asp Ala 

1620 I 625 1630 

Leu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn Lys Phe Gin Phe Leu 

1635 1640 I* 45 

Val Met Asp Leu Pro Met Val Met Gly Asp Arg Leu His Cys Met Asp 

1655 1660 
val Leu Phe Ala Phe Thr Thr Arg Val Leu Gly^Asp Ser Ser Gly Lev^ 

A 6 sp 5 Thr Met Lys Thr Me^Met Glu Glu Lys Phe Met Glu Ala Asn Pro 

1685 "90 1*95 

Phe Lys Lys Leu Tyr Glu Pro lie Val Thr Thr Thr Lys Arg Lys Glu 

1700 I 705 1710 

Glu Glu Gin Gly Ala Ala Val He Gin Arg Ala Tyr Arg Lys His Met 

^5 1720 1725 

Glu Lys Met Val Lys Leu Arg Leu Lys Asp Arg Ser Ser Ser Ser His 

17-?0 1735 I 740 

Gin Val Phe Cys Asn Gly Asp Leu Ser Ser Leu Asp Val Ala Lys Val 
1745 1750 I 7 * 5 

Lys Val His Asn Asp 
1765 

(4) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 
(vi) ORIGINAL SOURCE : 

(A) ORGANISM: human 

(F) TISSUE TYPE: Dorsal root ganglia 

(G) CELL TYPE: Peripheral nerve 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
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GCTGAGCAGT GGGGCACTGA TATTTGAAGA TGTTCACCTT GAGAACCAAC CCAAAATCCA 6 0 

AGAATTACTA AATTGTACTG ACATTATTTT TACACATATT TTTATCCTGG AGATGGTACT 120 
AAAATGGGTA GCCTTCGGAT TTGGAAAGTA TTTCACCAGT GCCTGGTGCT GCCTTGATTT 
CATCATTGTG ATTGTCTCTG TGACCACCCT CATTAACTTA ATGGAATTGA AGTCCTTCCG 
GACTCTACGA GCACTGAGGC CTCTTCGTGC GCTGTCCCAG TTTGAAGGAA TGAAGGTGGT 
GGTCAATGCT CTCATAGGTG CCATACCTGC CATTCTGAAT GTTTTGCTTG TCTGCCTCAT 
TTTCTGGCTC GTATTTTGTA TTCTGGGAGT ATACTTCTTT TCTGGAAAAT TTGGGAAATG 
CATTAATGGA ACAGACTCAG TTATAAATTA TACCATCATT ACAAATAAAA GTCAATGTGA 
AAGTGGCAAT TTCTCTTGGA TCAACCAGAA AGTCAACTTT GACAATGTGG GAAATGCTTA 
CCTCGCTCTG CTGCAAGTGG CAACATTTAA GGGCTGGATG GATATTATAT ATGCAGCTGT 
TGATTCCACA GAGAAAGAAC AACAGCCAGA GTTTGAGAGC AATTCACTCG GTTACATTTA 
CTTCGTAGTC TTTATCATCT TTGGCTCATT CTTCACTCTG AATCTCTTCA TTGGCGTTAT 
CATTGACAAC TTCAACCAAC AGCAGAAAAA GTTAGGTGGC CAAGACATTT TTATGACAGA 
AGAACAGAAG AAATACTATA ATGCAATGAA AAAATTAGGA TCCAAAAAAC CTCAAAAACC 
CATTCCACGG CCCGTT 

(5) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : RT-PCR 

(A) DESCRIPTION: /desc = DNA probe/domain IV 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: rat 

(F) TISSUE TYPE: dorsal root ganglia 

(G) CELL TYPE: peripheral nerve 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
856 



CTCAACATGG TTACGATGAT GGTGGAGACC GACGAGCAGG GCGAGGAGAA GACGAAGGTT 
CTGGGCAGAA TCAACCAGTT CTTTGTGGCC GTCTTCACGG GCGAGTGTGT GATGAAGATG 
TTCGCCCTGC GACAGTACTA TTTCACCAAC GGCTGGAACG TGTTCGACTT CATAGTGGTG 
ATCCTGTCCA TTGGGAGTCT GCTGTTTCTG CAATCCTTAA GTCACTGGAA AACTACTTCT 240 



60 
120 
180 
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CTTCCGGGTC ATCCGTCTGG 


CCAGGATCGG 


CCGCATCCTC 


AGGCTGATCC 


300 


r* n c r* a n r* P 3\ 


GGGGATTCGC 


ACGCTGCTCT 


TCGCCCTCAT 


GATGTCCCTG 


CCCGCCCTCT 


360 


1 ^PiJ\\—-r\ J- ^-VjVJ 


CCTCCTCCTC 


TTCCTCGTCA 


TGTTCATCTA 


CTCCATCTTC 


GGCATGGCCA 


420 




CGTCGTGGAC 


GAGGCCGGCA 


TCGACGACAT 


GTTCAACTTC 


AAGACCTTTG 


480 


f;rAACAGCAT 


d PTRTGC CTG 


TTCCAGATCA 


CCACCTCGGC 


CGGCTGGGAC 


GGCCTCCTCA 


540 


GCCCCATCCT 


CAACACGGGG 


CCTCCCTACT 


fiffiACCCCAA 


CCTGCCCAAC 


AGCAACGGCT 


600 


CCCGGGGGAA 


CTGCGGGAGC 


CCGGCGGTGG 


GCATCATCTT 


CTTCACCACC 


T AC AT CATC A 


660 


TCTCCTTCCT 


CATCGTGGTC 


AACATGTATA 


TCGCAGTCAT 


C 




701 



(5) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RT-PCR 

(A) DESCRIPTION: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



GTCGACTCTA 


GATCAGGGTG AAGATGGAGG 


AGAGGTACTA 


CCCGGTGATC 


TTCCCGGACG 


60 


AGCGGAATTT 


CCGCCCCTTC ACTTCCGACT 


CTCTGGCTGC 


CATAGAGAAG 


CGGATTGCTA 


120 


TCCAAAAGGA 


GAGGAAGAAG TCCAAAGACA AGGCGGCAGC 


TGAGCCCCAG 


CCTCGGCCTC 


180 


AGCTTGACCT 


AAAGGCCTCC AGGAAGTTAC 


CTAAGCTTTA 


TGGTGACATT 


CCCCCTGAGC 


240 


TTGTAGCGAA 


GCCTCTGGAA GACCTGGACC 


CATTCTACAA AG AC C ATAAG 


ACATTCATGG 


300 


TGTTGAACAA 


GAAGAGAACA ATTTATCGCT 


TCAGCGCCAA 


GCGGGCCTTG 


TTCATTCTGG 


360 


GGCCTTTTAA 


TCCCCTCAGA AGCTTAATGA 


TTCGTATCTC 


TGTCCATTCA 


GTCTTTAGCA 


420 


TGTTCATCAT 


CTGCACGGTG ATCATCAACT 


GTATGTTCAT 


GGCGAATTCT 


ATGGAGAGAA 


480 


GTTTCGACAA 


CGACATTCCC GAATACGTCT 


TCATTGGGAT 


TTATATTTTA 


GAAGCTGTGA 


540 


TTAAAATATT 


GGCAAGAGGC TTCATTGTGG 


ATGAGTTTTC 


CTTCCTCCGA 


GATCCGTGGA 


600 
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ACTGGCTGGA CTTCATTGTC ATTGGAACAG CGATCGCAAC TTGTTTTCCG GGCAGCCAAG 660 

TCAATCTTTC AGCTCTTCGT ACCTTCCGAG TGTTCAGAGC TCTGAAGGCG ATTTCAGTTA 72 0 

TCTCAGGTCT GAAGGTCATC GTAGGTGCCC TGCTGCGCTC GGTGAAGAAG CTGGTAGACG 78 0 

TGATGGTCCT CACTCTCTTC TGCCTCAGCA TCTTTGCCCT GGTCGGTCAG CAGCTGTTCA 840 

TGGGAATTCT GAACCAGAAG TGTATTAAGC ACAACTGTGG CCCCAACCCT GCATCCAACA 900 

AGGATTGCTT TGAAAAGGAA AAAGATAGCG AAGACTTCAT AATGTGTGGT ACCTGGCTCG 960 

GCAGCAGACC CTGTCCCAAT GGTTCTACGT GCGATAAAAC CACATTGAAC CCAGACAATA 1020 

ATTATACAAA GTTTGACAAC TTTGGCTGGT CCTTTCTCGC CATGTTCCGG GTTATGACTC 1080 

AAGACTCCTG GGAGAGGCTT TACCGACAGA TCCTGCGGAC CTCTGGGATC TACTTTGTCT 114 0 

TCTTCTTCGT GGTGGTCATC TTCCTGGGCT CCTTCTACCT GCTTAACCTA ACCCTGGCTG 1200 

TTGTCACCAT GGCTTATGAA GAACAGAACA GAAATGTAGC TGCTGAGACA GAGGCCAAGG 126 0 

AGAAAATGTT TCAGGAAGCC CAGCAGCTGT TAAGGGAGGA GAAGGAGGCT CTGGTTGCCA 1320 

TGGGAATTGA CAGAAGTTCC CTTAATTCCC TTCAAGCTTC ATCCTTTTCC CCGAAGAAGA 13 80 

GGAAGTTTTT CGGTAGTAAG ACAAGAAAGT CCTTCTTTAT GAGAGGGTCC AAGACGGCCC 1440 

AAGCCTCAGC GTCTGATTCA GAGGACGATG CCTCTAAAAA TCCACAGCTC CTTGAGCAGA 1500 

CCAAACGACT GTCCCAGAAC TTGCCAGTGG ATCTCTTTGA TGAGCACGTG GACCCCCTCC 1560 

ACAGGCAGAG AGCGCTGAGC GCTGTCAGTA TCTTAACCAT CACCATGCAG GAACAAGAAA 162 0 

AATTCCAGGA GCCTTGTTTC CCATGTGGGA AAAATTTGGC CTCTAAGTAC CTGGTGTGGG 16 8 0 

ACTGTAGCCC TCAGTGGCTG TGCATAAAGA AGGTCCTGCG GACCATCATG ACGGATCCCT 1740 

TTACTGAGCT GGCCATCACC ATCTGCATCA TCATCAATAC CGTTTTCTTA GCCGTGGAGC 1800 

ACCACAACAT GGATGACAAC TTAAAGACCA TACTGAAAAT AGGAAACTGG GTTTTCACGG 1860 

GAATTTTCAT AGCGGAAATG TGTCTCAAGA TCATCGCGCT CGACCCTTAC CACTACTTCC 192 0 

GGCACGGCTG GAATGTTTTT GACAGCATCG TGGCCCTCCT GAGTCTCGCT GATGTGCTCT 1980 

ACAACACACT GTCTGATAAC AATAGGTCTT TCTTGGCTTC CCTCAGAGTG CTGAGGGTCT 2040 

TCAAGTTAGC CAAATCCTGG CCCACGTTAA ACACTCTCAT TAAGATCATC GGCCACTCCG 2100 

TGGGCGCGCT TGGAAACCTG ACTGTGGTCC TGACTATCGT GGTCTTCATC TTTTCTGTGG 2160 

TGGGCATGCG GCTCTTCGGC ACCAAGTTTA ACAAGACCGC CTACGCCACC CAGGAGCGGC 222 0 

CCAGGCGGCG CTGGCACATG GATAATTTCT ACCACTCCTT CCTGGTGGTG TTCCGCATCC 2280 
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TCTGTGGGGA ATGGATCGAG AACATGTGGG GCTGCATGCA GGATATGGAC GGCTCCCCGT 2340 
TGTGCATCAT TGTCTTTGTC CTGATAATGG TGATCGGGAA GCTTGTGGTG CTTAACCTCT 2400 
TCATTGCCTT GCTGCTCAAT TCCTTCAGCA ATGAGGAGAA GGATGGGAGC CTGGAAGGAG 2460 
AGACCAGGAA AACCAAAGTG CAGCTAGCCC TGGATCGGTT CCGCCGGGCC TTCTCCTTCA 2520 
TGCTGCACGC TCTTCAGAGT TTTTGTTGCA AGAAATGCAG GAGGAAAAAC TCGCCAAAGC 2580 
CAAAAGAGAC AACAGAAAGC TTTGCTGGTG AGAATAAAGA CTCAATCCTC CCGGATGCGA 2640 
GGCCCTGGAA GGAGTATGAT ACAGACATGG CTTTGTACAC TGGACAGGCC GGGGCTCCGC 2700 
TGGCCCCACT CGCAGAGGTA GAGGACGATG TGGAATATTG TGGTGAAGGC GGTGCCCTAC 2760 
CCACCTCACA ACATAGTGCT GGAGTTCAGG CCGGTGACCT CCCTCCAGAG ACCAAGCAGC 2820 
TCACTAGCCC GGATGACCAA GGGGTTGAAA TGGAAGTATT TTCTGAAGAA GATCTGCATT 2880 
TAAGCATACA GAGTCCTCGA AAGAAGTCTG ACGCAGTGAG CATGCTCTCG GAATGCAGCA 2940 
CAATTGACCT GAATGATATC TTTAGAAATT TACAGAAAAC AGTTTCCCCC AAAAAGCAGC 3000 
CAGATAGATG CTTTCCCAAG GGCCTTAGTT GTCACTTTCT ATGCCACAAA ACAGACAAGA 3060 
GAAAGTCCCC CTGGGTCCTG TGGTGGAACA TTCGGAAAAC CTGCTACCAA ATCGTGAAGC 3120 
ACAGCTGGTT TGAGAGTTTC ATAATCTTTG TTATTCTGCT GAGCAGTGGA GCGCTGATAT 3180 
TTGAAGATGT CAATCTCCCC AGCCGGCCCC AAGTTGAGAA ATTACTAAGG TGTACCGATA 3240 
ATATTTTCAC ATTTATTTTC CTCCTGGAAA TGATCCTGAA GTGGGTGGCC TTTGGATTCC 3300 
GGAGGTATTT CACCAGTGCC TGGTGCTGGC TTGATTTCCT CATTGTGGTG GTGTCTGTGC 3360 
TCAGTCTCAT GAATCTACCA AGCTTGAAGT CCTTCCGGAC TCTGCGGGCC CTGAGACCTC 3420 
TGCGGGCGCT GTCCCAGTTT GAAGGAATGA AGGTTGTCGT CTACGCCCTG ATCAGCGCCA 3480 
TACCTGCCAT TCTCAATGTC TTGCTGGTCT GCCTCATTTT CTGGCTCGTA TTTTGTATCT 3540 
TGGGAGTAAA TTTATTTTCT GGGAAGTTTG GAAGGTGCAT TAACGGGACA GACATAAATA 3600 
TGTATTTGGA TTTTACCGAA GTTCCGAACC GAAGCCAATG TAACATTAGT AATTACTCGT 3660 
GGAAGGTCCC GCAGGTCAAC TTTGACAACG TGGGGAATGC CTATCTCGCC CTGCTGCAAG 3720 
TGGCAACCTA TAAGGGCTGG CTGGAAATCA TGAATGCTGC TGTCGATTCC AGAGAGAAAG 3780 
ACGAGCAGCC GGACTTTGAG GCGAACCTCT ACGCGTATCT CTACTTTGTG GTTTTTATCA 3840 
TCTTCGGCTC CTTCTTTACC CTGAACCTCT TTATCGGTGT TATTATTGAC AACTTCAATC 3900 
AGCAGCAGAA AAAGTTAGGT GGCCAAGACA TCTTCATGAC TGAGGAGCAG AAGAAATATT 3960 
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ACAATGCAAT GAAAAAGTTA GGAACCAAGA AACCTCAAAA GCCCATCCCA AGGCCCCTGA 4 020 

ACAAATGTCA AGCCTTTGTG TTCGACCTGG TCACAAGCCA GGTCTTTGAC GTCATCATTC 4 080 

TGGGTCTTAT TGTCTTAAAT ATGATTATCA TGATGGCTGA ATCTGCCGAC CAGCCCAAAG 414 0 

ATGTGAAGAA AACCTTTGAT ATCCTCAACA TAGCCTTCGT GGTCATCTTT ACCATAGAGT 4200 

GTCTCATCAA AGTCTTTGCT TTGAGGCAAC ACTACTTCAC CAATGGCTGG AACTTATTTG 426 0 

ATTGTGTGGT CGTGGTTCTT TCTATCATTA GTACCCTGGT TTCCCGCTTG GAGGACAGTG 4320 

ACATTTCTTT CCCGCCCACG CTCTTCAGAG TCGTCCGCTT GGCTCGGATT GGTCGAATCC 43 8 0 

TCAGGCTGGT CCGGGCTGCC CGGGGAATCA GGACCCTCCT CTTTGCTTTG ATGATGTCTC 4440 

TCCCCTCTCT CTTCAACATC GGTCTGCTGC TCTTCCTGGT GATGTTCATT TACGCCATCT 4 500 

TTGGGATGAG CTGGTTTTCC AAAGTGAAGA AGGGCTCCGG GATCGACGAC ATCTTCAACT 4 560 

TCGAGACCTT TACGGGCAGC ATGCTGTGCC TCTTCCAGAT AACCACTTCG GCTGGCTGGG 462 0 

ATACCCTCCT CAACCCCATG CTGGAGGCAA AAGAACACTG CAACTCCTCC TCCCAAGACA 46 8 0 

GCTGTCAGCA GCCGCAGATA GCCGTCGTCT ACTTCGTCAG TTACATCATC ATCTCCTTCC 474 0 

TCATCGTGGT CAACATGTAC ATCGCTGTGA TCCTCGAGAA CTTCAACACA GCCACGGAGG 4800 

AGAGCGAGGA CCCTCTGGGA GAGGACGACT TTGAAATCTT CTATGAGGTC TGGGAGAAGT 486 0 

TTGACCCCGA GGCGTCGCAG TTCATCCAGT ATTCGGCCCT CTCTGACTTT GCGGACGCCC 4 920 

TGCCGGAGCC GTTGCGTGTG GCCAAGCCGA ATAAGTTTCA GTTTCTAGTG ATGGACTTGC 4 980 

CCATGGTGAT GGGCGACCGC CTCCATTGCA TGGATGTTCT CTTTGCTTTC ACTACCAGGG 5040 

TCCTCGGGGA CTCCAGCGGC TTGGATACCA TGAAAACCAT GATGGAGGAG AAGTTTATGG 5100 

AGGCCAACCC TTTTAAGAAG CTCTACGAGC CCATAGTCAC CACCACCAAG AGGAAGGAGG 5160 

AGGAGCAAGG CGCCGCCGTC ATCCAGAGGG CCTACCGGAA ACACATGGAG AAGATGGTCA 5220 

AACTGAGGCT GAAGGACAGG TCAAGTTCAT CGCACCAGGT GTTTTGCAAT GGAGACTTGT 5280 

CCAGCTTGGA TGTGGCCAAG GTCAAGGTTC ACAATGACTG AACCCTCATC TAGA 53 34 
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CLAIMS 



What is claimed is: 

1 . An isolated DNA sequence comprising the nucleotide sequence set forth in SEQ ID NO: 1 
or SEQ ID NO:3. 

2. The DNA of Claim 1 wherein said DNA sequence is encoding a sodium channel protein 
or fragment thereof. 

3. The DNA of Claim 2 wherein said sodium channel protein is the a-subunit or fragment 
thereof. 

4. The DNA of Claim 3 wherein said sodium channel protein is tetrodotoxin-resistant. 

5. The DNA of Claim 3 or 4 wherein said sodium channel protein is found in mammals. 

6. The DNA of Claim 3 or 4 wherein said sodium channel protein is found in rat. 

7. The DNA of Claim 3 or 4 wherein said sodium channel protein is found in human. 

8. The DNA of Claim 1 wherein said DNA is cDNA. 

9. The DNA of Claim 1 wherein said DNA is synthetic DNA. 

10. Expression vectors comprising the DNA of Claim 8. 

1 1 . Expression vectors comprising the synthetic DNA of Claim 9. 

12. Host cells transformed with the expression vectors of Claim 10. 

1 3 . Host cells transformed with the expression vectors of Claim 1 1 . 

14. A recombinant polynucleotide comprising a nucleic acid sequence derived from the DNA 
sequence of Claim 1. 

15. A sodium channel protein encoded by a DNA of Claims 1 to 9 or allelic variants thereof. 

16. A tetrodotoxin-resistant sodium channel protein encoded by a DNA of Claims 1 to 9 or 
allelic variants thereof. 

17. The protein of Claim 16 having the amino acid sequence set forth in SEQ ID NO:2. 

18. A method for identifying inhibitors of tetrodotoxin-resistant sodium channel protein 
comprising contacting a compound suspected of being said inhibitor with sodium channel 
protein of claim 16 and measuring the activity of said expressed sodium channel protein. 

19. Poly- and/or monoclonal antibodies raised against a tetrodotoxin-resistant sodium channel 
protein encoded by a DNA of Claims 1 to 9 or allelic variants thereof. 

20. A diagnostic kit comprising a polynucleotide of claim 14 capable of specifically 
hybridizing to a tetrodotoxin-resistant sodium channel protein or fragment thereof. 

21. The use of an isolated DNA sequence of Claims 1 to 9 for identifying a compound 
suspected of being an inhibitor of tetrodotoxin-resistant sodium channel protein. 

22. The invention substantially as hereinbefore described especially with reference to the 
foregoing Examples. 
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Figure 1A: SEQ ID N0:1 
1 GAAGTCACAG GAGTGTCTGT CAGCGAGAGG AAGAAGGGAG AGTTTACTGA 

51 GTGTCTTCTG CCCCTCCTCA GGGTGAAGAT_GGAGGAGAGG TACTACCCGG 

101 TGATCTTCCC GGACGAGCGG AATTTCCGCC CCTTCACTTC CGACTCTCTG 

151 GCTGCCATAG AGAAGCGGAT TGCTATCCAA AAGGAGAGGA AGAAGTCCAA 

201 AGACAAGGCG GCAGCTGAGC CCCAGCCTCG GCCTCAGCTT GACCTAAAGG 

251 CCTCCAGGAA GTTACCTAAG CTTTATGGTG ACATTCCCCC TGAGCTTGTA 

301 GCGAAGCCTC TGGAAGACCT GGACCCATTC TACAAAGACC ATAAGACATT 

351 CATGGTGTTG AACAAGAAGA GAACAATTTA TCGCTTCAGC GCCAAGCGGG 

401 CCTTGTTCAT TCTGGGGCCT TTTAATCCCC TCAGAAGCTT AATGATTCGT 

451 ATCTCTGTCC ATTCAGTCTT TAGCATGTTC ATCATCTGCA CGGTGATCAT 

501 CAACTGTATG TTCATGGCGA ATTCTATGGA GAGAAGTTTC GACAACGACA 

551 TTCCCGAATA CGTCTTCATT GGGATTTATA TTTTAGAAGC TGTGATTAAA 

601 ATATTGGCAA GAGGCTTCAT TGTGGATGAG TTTTCCTTCC TCCGAGATCC 

651 GTGGAACTGG CTGGACTTCA TTGTCATTGG AACAGCGATC GCAACTTGTT 

701 TTCCGGGCAG CCAAGTCAAT CTTTCAGCTC TTCGTACCTT CCGAGTGTTC 

751 AGAGCTCTGA AGGCGATTTC AGTTATCTCA GGTCTGAAGG TCATCGTAGG 

801 TGCCCTGCTG CGCTCGGTGA AGAAGCTGGT AGACGTGATG GTCCTCACTC 

851 TCTTCTGCCT CAGCATCTTT GCCCTGGTCG GTCAGCAGCT GTTCATGGGA 

901 ATTCTGAACC AGAAGTGTAT TAAGCACAAC TGTGGCCCCA ACCCTGCATC 

951 CAACAAGGAT TGCTTTGAAA AGGAAAAAGA TAGCGAAGAC TTCATAATGT 

1001 GTGGTACCTG GCTCGGCAGC AGACCCTGTC CCAATGGTTC TACGTGCGAT 

1051 AAAACCACAT TGAACCCAGA CAATAATTAT ACAAAGTTTG ACAACTTTGG 

1101 CTGGTCCTTT CTCGCCATGT TCCGGGTTAT GACTCAAGAC TCCTGGGAGA 

1151 GGCTTTACCG ACAGATCCTG CGGACCTCTG GGATCTACTT TGTCTTCTTC 

1201 TTCGTGGTGG TCATCTTCCT GGGCTCCTTC TACCTGCTTA ACCTAACCCT 



Figure IB: SEQIDN0:1 
1251 GGCTGTTGTC ACCATGGCTT ATGAAGAACA GAACAGAAAT GTAGCTGCTG 

1301 AGACAGAGGC CAAGGAGAAA ATGTTTCAGG AAGCCCAGCA GCTGTTAAGG 

1351 GAGGAGAAGG AGGCTCTGGT TGCCATGGGA ATTGACAGAA GTTCCCTTAA 

1401 TTCCCTTCAA GCTTCATCCT TTTCCCCGAA GAAGAGGAAG TTTTTCGGTA 

1451 GTAAGACAAG AAAGTCCTTC TTTATGAGAG GGTCCAAGAC GGCCCAAGCC 

1501 TCAGCGTCTG ATTCAGAGGA CGATGCCTCT AAAAATCCAC AGCTCCTTGA 

1551 GCAGACCAAA CGACTGTCCC AGAACTTGCC AGTGGATCTC TTTGATGAGC 

1601 ACGTGGACCC CCTCCACAGG CAGAGAGCGC TGAGCGCTGT CAGTATCTTA 

1651 ACCATCACCA TGCAGGAACA AGAAAAATTC CAGGAGCCTT GTTTCCCATG 

1701 TGGGAAAAAT TTGGCCTCTA AGTACCTGGT GTGGGACTGT AGCCCTCAGT 

1751 GGCTGTGCAT AAAGAAGGTC CTGCGGACCA TCATGACGGA TCCCTTTACT 

1801 GAGCTGGCCA TCACCATCTG CATCATCATC AATACCGTTT TCTTAGCCGT 

1851 GGAGCACCAC AACATGGATG ACAACTTAAA GACCATACTG AAAATAGGAA 

1901 ACTGGGTTTT CACGGGAATT TTCATAGCGG AAATGTGTCT CAAGATCATC 

1951 GCGCTCGACC CTTACCACTA CTTCCGGCAC GGCTGGAATG TTTTTGACAG 

2001 CATCGTGGCC CTCCTGAGTC TCGCTGATGT GCTCTACAAC ACACTGTCTG 

2051 ATAACAATAG GTCTTTCTTG GCTTCCCTCA GAGTGCTGAG GGTCTTCAAG 

2101 TTAGCCAAAT CCTGGCCCAC GTTAAACACT CTCATTAAGA TCATCGGCCA 

2151 CTCCGTGGGC GCGCTTGGAA ACCTGACTGT GGTCCTGACT ATCGTGGTCT 

2201 TCATCTTTTC TGTGGTGGGC ATGCGGCTCT TCGGCACCAA GTTTAACAAG 

2251 ACCGCCTACG CCACCCAGGA GCGGCCCAGG CGGCGCTGGC ACATGGATAA 

2301 TTTCTACCAC TCCTTCCTGG TGGTGTTCCG CATCCTCTGT GGGGAATGGA 

2351 TCGAGAACAT GTGGGGCTGC ATGCAGGATA TGGACGGCTC CCCGTTGTGC 

2401 ATCATTGTCT TTGTCCTGAT AATGGTGATC GGGAAGCTTG TGGTGCTTAA 
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Figure 1C:SEQ ID N0:1 
2451 CCTCTTCATT GCCTTGCTGC TCAATTCCTT CAGCAATGAG GAGAAGGATG 
2501 GGAGCCTGGA AGGAGAGACC AGGAAAACCA AAGTGCAGCT AGCCCTGGAT 
2551 CGGTTCCGCC GGGCCTTCTC CTTCATGCTG CACGCTCTTC AGAGTTTTTG 
2601 TTGCAAGAAA TGCAGGAGGA AAAACTCGCC AAAGCCAAAA GAGACAACAG 
2651 AAAGCTTTGC TGGTGAGAAT AAAGACTCAA TCCTCCCGGA TGCGAGGCCC 
2,01 TGGAAGGAGT ATGATACAGA CATGGCTTTG TACACTGGAC AGGCCGGGGC 
2751 TCCGCTCGCC CCACTCGCAG AGGTAGAGGA CGATGTGGAA TATTGTGGTG 
2801 AAGGCGGTGC CCTAOCCACC TCACAACATA GTGCTGGAGT TCAGGCCGGT 
2851 GACCTCCCTC CAGAGACCAA GCAGCTCACT AGCCCGGATG ACCAAGGGGT 
2901 TGAAATGGAA GTATTTTCTG AAGAAGATCT GCATTTAAGC ATACAGAGTC 
2951 CTCGAAAGAA GTCTGACGCA GTGAGCATGC TCTCGGAATG CAGCACAATT 
3001 GACCTGAATG ATATCTTTAG AAATTTACAG AAAACAGTTT CCCCCAAAAA 
3051 GCAGCCAGAT AGATGCTTTC CCAAGGGCCT TAGTTGTCAC TTTCTATGCC 
3101 ACAAAACAGA CAAGAGAAAG TCCCCCTGGG TCCTGTGGTG GAACATTCGG 
3151 AAAACCTGCT ACCAAATCGT GAAGCACAGC TGGTTTGAGA GTTTCATAAT 
3201 CTTTGTTATT CTGCTGAGCA GTGGAGCGCT GATATTTGAA GATGTCAATC 
3251 TCCCCAGCCG GCCCCAAGTT GAGAAATTAC TAAGGTGTAC CGATAATATT 
3301 TTCACATTTA TTTTCCTCCT GGAAATGATC CTGAAGTGGG TGGCCTTTGG 
3351 ATTCCGGAGG TATTTCACCA GTGCCTGGTG CTGGCTTGAT TTCCTCATTG 
3401 TGGTGGTGTC TGTGCTCAGT CTCATGAATC TACCAAGCTT GAAGTCCTTC 
3451 CGGACTCTGC GGGCCCTGAG ACCTCTGCGG GCGCTGTCCC AGTTTGAAGG 
3501 AATGAAGGTT GTCGTCTACG CCCTGATCAG CGCCATACCT GCCATTCTCA 
3551 ATGTCTTGCT GGTCTGCCTC ATTTTCTGGC TCGTATTTTG TATCTTGGGA 
3601 GTAAATTTAT TTTCTGGGAA GTTTGGAAGG TGCATTAACG GGACAGACAT 



Figure ID: SEQIDN0:1 
3651 AAATATGTAT TTGGATTTTA CCGAAGTTCC GAACCGAAGC CAATGTAACA 
3701 TTAGTAATTA CTCGTGGAAG GTCCCGCAGG TCAACTTTGA CAACGTGGGG 
375! AATGCCTATC TCGCCCTGCT GCAAGTGGCA ACCTATAAGG GCTGGCTGGA 
3801 AATCATGAAT GCTGCTGTCG ATTCCAGAGA GAAAGACGAG CAGCCGGACT 
3851 TTGAGGCGAA CCTCTACGCG TATCTCTACT TTGTGGTTTT TATCATCTTC 
3901 GGCTCCTTCT TTACCCTGAA CCTCTTTATC GGTGTTATTA TTGACAACTT 
3951 CAATCAGCAG CAGAAAAAGT TAGGTGGCCA AGACATTTTT ATGACAGAAG 
4001 AACAGAAGAA ATATTACAAT GCAATGAAAA AGTTAGGAAC CAAGAAACCT 
4051 CAAAAGCCCA TCCCAAGGCC CCTGAACAAA TGTCAAGCCT TTGTGTTCGA 
4101 CCTGGTCACA AGCCAGGTCT TTGACGTCAT CATTCTGGGT CTTATTGTCT 
4151 TAAATATGAT TATCATGATG GCTGAATCTG CCGACCAGCC CAAAGATGTG 
4201 AAGAAAACCT TTGATATCCT CAACATAGCC TTCGTGGTCA TCTTTACCAT 
4251 AGAGTGTCTC ATCAAAGTCT TTGCTTTGAG GCAACACTAC TTCACCAATG 
4301 GCTGGAACTT ATTTGATTGT GTGGTCGTGG TTCTTTCTAT CATTAGTACC 
4351 CTGGTTTCCC GCTTGGAGGA CAGTGACATT TCTTTCCCGC CCACGCTCTT 
4401 CAGAGTCGTC CGCTTGGCTC GGATTGGTCG AATCCTCAGG CTGGTCCGGG 
4451 CTGCCCGGGG AATCAGGACC CTCCTCTTTG CTTTGATGAT GTCTCTCCCC 
4501 TCTCTCTTCA ACATCGGTCT GCTGCTCTTC CTGGTGATGT TCATTTACGC 
4551 CATCTTTGGG ATGAGCTGGT TTTCCAAAGT GAAGAAGGGC TCCGGGATCG 
4601 ACGACATCTT CAACTTCGAG ACCTTTACGG GCAGCATGCT GTGCCTCTTC 
4651 CAGATAACCA CTTCGGCTGG CTGGGATACC CTCCTCAACC CCATGCTGGA 
4701 GGCAAAAGAA CACTGCAACT CCTCCTCCCA AGACAGCTGT CAGCAGCCGC 
4751 AGATAGCCGT CGTCTACTTC GTCAGTTACA TCATCATCTC CTTCCTCATC 
4801 GTGGTCAACA TGTACATCGC TGTGATCCTC GAGAACTTCA ACACAGCCAC 



Figure IE: SEQ ID NO: 1 
4851 GGAGGAGAGC GAGGACCCTC TGGGAGAGGA CGACTTTGAA ATCTTCTATG 

4901 AGGTCTGGGA GAAGTTTGAC CCCGAGGCGT CGCAGTTCAT CCAGTATTCG 
4951 GCCCTCTCTG ACTTTGCGGA CGCCCTGCCG GAGCCGTTGC GTGTGGCCAA 
5001 GCCGAATAAG TTTCAGTTTC TAGTGATGGA CTTGCCCATG GTGATGGGCG 
5051 ACCGCCTCCA TTGCATGGAT GTTCTCTTTG CTTTCACTAC CAGGGTCCTC 
5101 GGGGACTCCA GCGGCTTGGA TACCATGAAA ACCATGATGG AGGAGAAGTT 
5151 TATGGAGGCC AACCCTTTTA AGAAGCTCTA CGAGCCCATA GTCACCACCA 
5201 CCAAGAGGAA GGAGGAGGAG CAAGGCGCCG CCGTCATCCA GAGGGCCTAC 
5251 CGGAAACACA TGGAGAAGAT GGTCAAACTG AGGCTGAAGG ACAGGTCAAG 
5301 TTCATCGCAC CAGGTGTTTT GCAATGGAGA CTTGTCCAGC TTGGATGTGG 
5351 CCAAGGTCAA GGTTCACAAT GACTGAACCC TCATCTCCAC CCCTACCTCA 
5401 CTGCCTCACA GCTTAGCCTC CAGCCTCTGG CGAGCAGGCG GCAGACTCAC 
5451 TGAACACAGG CCGTTCGATC TGTGTTTTTG GCTGAACGAG GTGACAGGTT 
5501 GGCGTCCATT TTTAAATGAC TCTTGGAAAG ATTTCATGTA GAGAGATGTT 
5551 AGAAGGGACT GCAAAGGACA CCGACCATAA CGGAAGGCCT GGAGGACAGT 
5601 CCAACTTACA TAAAGATGAG AAACAAGAAG GAAAGATCCC AGGAAAACTT 
5651 CAGATTGTGT TCTCAGTACA TCCCCCAATG TGTCTGTTCG GTGTTTTGAG 
5701 TATGTGACCT GCCACATGTA GCTCTTTTTT GCATGTACGT CAAAACCCTG 
57 51 CAGTAAGTTG ATAGCTTGCT ACGGGTGTTC CTACCAGCAT CACAGAATTG 
5801 GGTGTATGAC TCAAACCTAA AAGCATGACT CTGACTTGTC AGTCAGCACC 
5851 CCGACTTTCA GACGCTCCAA TCTCTGTCCC AGGTGTCTAA CGAATAAATA 
5901 GGTAAAAG 



Figure 2A: SEQ ID NO: 2 
Met Glu Glu Arg Tyr Tyr Pro Val lie Phe Pro Asp Glu Arg Asn Phe 

Arg Pro Phe Thr Ser Asp Ser Leu Ala Ala He Glu Lys Arg He Ala 

20 " 

lie Gin Lys Glu Arg Lys Lys Ser Lys Asp Lys Ala Ala Ala Glu Pro 



35 

Gin Pro Arg Pro 
50 



40 



45 



Gin Leu Asp Leu Lys Ala Ser Arg Lys Leu Pro Lys 



55 



60 



Leu Tyr Gly Asp He Pro Pro Glu Leu Val Ala Lys Pro Leu Glu Asp 

65 70 " 80 

l1 Asp Pro Phe Tyr Lys Asp His Lys Thr Phe Met Val Leu Asn Lys 



85 



90 



95 



L ys Arg Thr lie Tyr Arg Phe Ser Ala Lys Arg Ala Leu Phe He Leu 
100 

Gly Pro Phe Asn Pro Leu Arg Ser Leu Met He Arg lie Ser Val His 



115 



120 



125 



Ser Val Phe Ser Met Phe He He Cys Thr Val He He Asn Cys Met 



130 



135 



140 



Phe Met Ala Asn Ser Met Glu Arg Ser Phe Asp Asn Asp He Pro Glu 

150 155 "0 

145 150 

Tyr val Phe He Gly He Tyr He Leu Glu Ala Val He Lys He Leu 



165 



170 



175 



Ala Arg 



Gly Phe He Val Asp Glu Phe Ser Phe Leu Arg Asp Pro Trp 



180 



185 



190 



Asn Trp Leu Asp Phe lie Val He Gly Thr Ala He Ala Thr Cys Phe 



195 



200 



205 



Pro Gly Ser Gin Val Asn Leu Ser Ala Leu Arg Thr Phe Arg Val Phe 



210 



215 



220 



Arg Ala Leu Lys Ala He Ser Val He Ser Gly Leu Lys Val lie Val 



225 



230 



235 



240 



Gly Ala Leu Leu Arg Ser Val Lys Lys Leu Val Asp Val Met Val Leu 



245 



250 



255 



Thr Leu Phe Cys Leu Ser He Phe Ala Leu Val Gly Gin Gin Leu Phe 



260 

Met Gly He Leu Asn 
275 



265 



270 



Gin Lys Cys He Lys His Asn Cys Gly Pro Asn 



280 



285 



295 300 

-si c 320 

305 JiU 



Qowiin f9 Sltalkif & JJenJerto* 



Figure 2B: SEQIDNO: 2 
Ser Thr Cys Asp Lys Thr Thr Leu Asn Pro Asp Asn Asn Tyr Thr Lys 

325 330 335 

Phe Asp Asn Phe Gly Trp Ser Phe Leu Ala Met Phe Arg Val Met Thr 

340 345 350 

Gin Asp Ser Trp Glu Arg Leu Tyr Arg Gin lie Leu Arg Thr Ser Gly 

355 360 365 

He Tyr Phe Val Phe Phe Phe Val Val Val He Phe Leu Gly Ser Phe 

370 375 380 

Tyr Leu Leu Asn Leu Thr Leu Ala Val Val Thr Met Ala Tyr Glu Glu 

"inc 400 
385 390 395 

Gin Asn Arg Asn Val Ala Ala Glu Thr Glu Ala Lys Glu Lys Met Phe 



405 410 415 



Gin Glu Ala Gin Gin Leu Leu Arg Glu Glu Lys Glu Ala Leu Val Ala 

420 425 430 

Met Gly lie Asp Arg Ser Ser Leu Asn Ser Leu Gin Ala Ser Ser Phe 

435 440 445 

Ser Pro Lys Lys Arg Lys Phe Phe Gly Ser Lys Thr Arg Lys Ser Phe 

450 455 460 

Phe Met Arg Gly Ser Lys Thr Ala Gin Ala Ser Ala Ser Asp Ser Glu 

470 

Asp Asp Ala Ser Lys Asn Pro Gin Leu Leu Glu Gin Thr Lys Arg Leu 

485 490 495 

Ser Gin Asn Leu Pro Val Asp Leu Phe Asp Glu His Val Asp Pro Leu 

500 505 510 

His Arg Gin Arg Ala Leu Ser Ala Val Ser lie Leu Thr lie Thr Met 

515 520 525 

Gin Glu Gin Glu Lys Phe Gin Glu Pro Cys Phe Pro Cys Gly Lys Asn 

530 535 540 

Leu Ala Ser Lys Tyr Leu Val Trp Asp Cys Ser Pro Gin Trp Leu Cys 
545 550 555 560 

lie Lys Lys Val Leu Arg Thr lie Met Thr Asp Pro Phe Thr Glu Leu 

565 570 575 

Ala lie Thr lie Cys lie lie lie Asn Thr Val Phe Leu Ala Val Glu 

580 585 590 

His His Asn Met Asp Asp Asn Leu Lys Thr lie Leu Lys lie Gly Asn 



595 600 605 

Trp Val Phe Thr Gly He Phe He Ala Glu Met Cys Leu Lys lie lie 
610 615 620 



Figure 2C: SEQ ID NO: 2 



Ala Leu Asp Pro Tyr His Tyr Phe Arg His Gly Trp Asn Val Phe Asp 

630 635 640 

Ser lie Val Ala Leu Leu Ser Leu Ala Asp Val Leu Tyr Asn Thr Leu 
645 

Ser Asp Asn Asn Arg Ser Phe Leu Ala Ser Leu Arg Val Leu Arg Val 

660 665 670 

Phe Lys Leu Ala Lys Ser Trp Pro Thr Leu Asn Thr Leu lie Lys He 

675 680 685 

He Gly His Ser Val Gly Ala Leu Gly Asn Leu Thr Val Val Leu Thr 

690 695 700 

lie val Val Phe lie Phe Ser Val Val Gly Met Arg Leu Phe Gly Thr 

710 715 720 

705 710 

Lys Phe Asn Lys Thr Ala Tyr Ala Thr Gin Glu Arg Pro Arg Arg Arg 
725 

Trp His Met Asp Asn 



730 



735 



Phe Tyr His Ser Phe Leu Val Val Phe Arg He 



740 



745 



750 



Leu Cys 



Gly Glu Trp He Glu Asn Met Trp Gly Cys Met Gin Asp Met 



755 



760 



765 



Asp Gly Ser Pro Leu Cys He lie Val Phe Val Leu He Met Val lie 

770 77 5 780 

Gly Lys Leu Val Val Leu Asn Leu Phe He Ala Leu Leu Leu Asn Ser 

785 790 795 800 

Phe Ser Asn Glu Glu Lys Asp Gly Ser Leu Glu Gly Glu Thr Arg Lys 

805 810 815 

Thr Lys Val Gin Leu Ala Leu Asp Arg Phe Arg Arg Ala Phe Ser Phe 

820 825 830 

Met Leu His Ala Leu Gin Ser Phe Cys Cys Lys Lys Cys Arg Arg Lys 

835 840 845 

Asn Ser Pro Lys Pro Lys Glu Thr Thr Glu Ser Phe Ala Gly Glu Asn 

850 855 860 

Lys Asp Ser He Leu Pro Asp Ala Arg Pro Trp Lys Glu Tyr Asp Thr 
865 870 875 880 

Asp Met Ala Leu Tyr Thr Gly Gin Ala Gly Ala Pro Leu Ala Pro Leu 
885 890 



895 



Al. Glu Val Glu Asp Asp Val Glu Tyr Cys Gly Glu Gly Gly Ala Leu 
900 905 910 



Figure 2D: SEQ ED NO: 2 

915 920 925 

935 940 
Val Phi Ser Clu Olu Asp Leu His Leu Ser lie Gin Ser Pro Arg Lys 

945 950 955 

965 970 

"° 

995 1000 10° 5 

1010 

Lys Thr Cys Tyr Gin He Val Lys His Ser Trp Phe Glu Ser Phe He 
1025 

He Phe Val lie Leu Leu Ser Ser Gly Ala Leu He Phe Glu Asp Val 



1025 

Val lie Leu Leu Ser Ser Gly Ala Leu He Phe Glu t 

1055 

1045 J -"" JV 



1060 



1065 



1070 



Asn He Phe Thr Phe He Phe Leu Leu Glu Met He Leu Lys Trp Val 



1075 



1080 



1085 



Ala Phe Gly Phe Arg Ar 9 Tyr Phe Thr Ser Ala Trp Cys Trp Leu Asp 

1090 ™* 
Phe Leu lie Val Val Val Ser Val Leu Ser Leu Met Asn Leu Pro Ser 

; 120 

L eu Lys Ser Phe Arg Thr Leu Arg Ala Leu Arg Pro Leu Arg Ala Leu 



1125 



1130 



1135 



Ser Gin Phe Glu Gly Met Lys Val Val Val Tyr Ala Leu He Ser Ala 

1150 

He Pro Ala He Leu Asn Val Leu Leu Val Cys Leu He Phe Trp Leu 



1155 



1160 



1165 



Val Phe Cys He Leu Gly Val Asn Leu Phe Ser Gly Lys Phe Gly Arg 

1175 

He Asn Met Tyr Leu Asp Phe Thr Glu Val 



Cys He Asn Gly Thr Asp 



1200 



185 1190 1195 

Pro Asn Arg Ser Gin Cys Asn lie Ser Asn Tyr Ser Trp Lys Val Pro 
1205 1210 "IS 



/OA O * <° ' ' ' 



Figure 2E: SEQIDNO: 2 



Gin Val Asn Phe Asp Asn Val Gly Asn Ala Tyr Leu Ala Leu Leu Gin 



1220 



1225 1230 



Val 



Ala Thr Tyr Lys Gly Trp Leu Glu lie Met Asn Ala Ala Val Asp 



1235 



1240 1245 



Ser Arg 



Glu Lys Asp Glu Gin Pro Asp Phe Glu Ala Asn Leu Tyr Ala 



1255 1260 



1250 

Tyr Leu Tyr Phe Val Val Phe He lie Phe Gly Ser Phe Phe Thr Leu 
1265 

Asn Leu Phe lie Gly Val lie lie Asp Asn Phe Asn Gin Gin Gin Lys 

1285 1290 1295 

Lys Leu Gly Gly Gin Asp He Phe Met Thr Glu Glu Gin Lys Lys Tyr 

1300 1305 1310 

Tyr Asn Ala Met Lys Lys Leu Gly Thr Lys Lys Pro Gin Lys Pro lie 

1315 1320 1325 

Pro Arg Pro Leu Asn Lys Cys Gin Ala Phe Val Phe Asp Leu Val Thr 

1330 1335 1340 

Ser Gin Val Phe Asp Val He lie Leu Gly Leu lie Val Leu Asn Met 
1345 1350 1355 1360 

lie lie Met Met Ala Glu Ser Ala Asp Gin Pro Lys Asp Val Lys Lys 

1365 1370 1375 

Thr Phe Asp lie Leu Asn lie Ala Phe Val Val lie Phe Thr lie Glu 

1380 "85 1390 

Cys Leu lie Lys Val Phe Ala Leu Arg Gin His Tyr Phe Thr Asn Gly 

1395 1400 I 405 

Trp Asn Leu Phe Asp Cys Val Val Val Val Leu Ser lie lie Ser Thr 

1410 1415 I 420 

Leu Val Ser Arg Leu Glu Asp Ser Asp lie Ser Phe Pro Pro Thr Leu 
1425 1430 1435 144C 

Phe Arg Val Val Arg Leu Ala Arg lie Gly Arg He Leu Arg Leu Val 

1445 1450 I 455 

Arg Ala Ala Arg Gly He Arg Thr Leu Leu Phe Ala Leu Met Met Ser 

1460 1465 1470 

Leu Pro Ser Leu Phe Asn He Gly Leu Leu Leu Phe Leu Val Met Phe 

1475 1480 1485 

He Tyr Ala He Phe Gly Met Ser Trp Phe Ser Lys Val Lys Lys Gly 



(I. O 1 r> 



1495 1500 
£er "ly lie Asp MP ne Phe »sn Phe Olu Thr Ph. Thr 01, ser Het 

1515 1520 



1505 



1510 



Figure 2F: SEQ ED NO: 2 

1525 "3° 1535 

-, T uic rv? Asn Ser Ser Ser Gin Asp 

Asn Pro Met Leu Glu Ala Lys Glu His Cys Asn 

1540 

Ser Cys Gin Gin Pro Gin He Ala Val Val Tyr Phe Val Ser Tyr He 

1555 1560 I 565 

He lie Ser Phe Leu lie Val Val Asn Met Tyr He Ala Val He Leu 

1575 1580 
Glu Asn 0 phe Asn Thr Ala Thr Glu Glu Ser Glu Asp Pro Leu Gly Glu 

85 1590 I 595 1600 

Asp'asp Phe Glu lie Phe Tyr Glu Val Trp Glu Lys Phe Asp Pro Glu 

1605 1615 
Ala Ser Gin Phe He Gin Tyr Ser Ala Leu Ser Asp Phe Ala Asp Ala 



1620 



1625 



1630 



L eu Pro Glu Pro Leu Arg Val Ala Lys Pro Asn Lys Phe Gin Phe Leu 



1635 



1640 



1645 



Val Met Asp Leu Pro Met Val Met Gly Asp Arg Leu His Cys Met Asp 



1650 



1655 



1660 



Val Leu Phe Ala Phe Thr Thr Arg Val Leu Gly Asp Ser Ser Gly Leu 



1665 



1670 



1675 



1680 



ASP Thr Met Lys Thr Met Met Glu Glu Lys Phe Met Glu Ala Asn Pro 

1685 1690 16 95 

Phe Lys Lys Leu Tyr Glu Pro He Val Thr Thr Thr Lys Arg Lys Glu 



1700 



1705 



1710 



Glu Glu Gin Gly Ala Ala Val He Gin Arg Ala Tyr Arg Lys His Met 



1715 



1720 



1725 



Glu Lys Met Val Lys Leu Arg Leu Lys Asp Arg Ser Ser Ser Ser His 



1730 



1735 



1740 



1750 I 755 



Gin Val Phe Cys Asn Gly Asp Leu Ser Ser Leu Asp Val Ala Lys Val 

1760 

1745 

Lys Val His Asn Asp 
1765 



n ft ' 



Figure 2G.SEQ ID N0:2 

1 MEERYYPVXF PDERNFRPFT SDSLAAXEKR IAIQKERKKS^ KDKAAAEPQP 

51 RPQLDLKASR KLPKLYGDIP PELVAKPL.ED LDPFYKDHKT FMVLNKKRTI 

101 YRFSAKRALF XL.GPFHP.RS LMXRXSVHSV F«cm ™ 

15l ERSFDNDIPE YVFXGXYX.E A^GF XVDEFSFLRB P— £ 

201 ^Jcw^^™^ SVI ™ 

25 1 WVMVL>TLFC ^CGPNPA SNKDCFEKEK 

301 ^"^"iSS^'iSiTIMP DNNYTKFDNF GWSFLAMFRV 
351 MTQDSWERLY RQXLRTSGXY FVFFFVWIF L G~T_I^MAYEE 
401 QNRNVAAETE AKEKMFQEAQ qIIrEEKEAL VAMGXDRSS. NSLQASSFSP 
451 KKRKFFGSKT RKSFFMRGSK TAQASASDSE DDASKNPQLL EQTKRLSQNL 
501 PVDLFDEHVD PLHRQRALSA VSXLTXTMQE QEKFQEPCFP CGKNLASKYL 
551 VWDCSPQWLC IKKVLRTIMT DPFTELMTI C 1 1 1 NTVFLA^ VEHHNMDDNL 
601 KTILKIGNWV FTGIF^C YkTxALDPYH YFRHGWNVF_D _S_XVAL,SLAD 
S51 ^^~i£^"^**a» TLNTLIKIIG HSVGALGNLT 
701 ^TXWFX; SVVGMRLFGT "kFNKTAY ATQ ERPRRRWHMD NFYHSFLWF 
751 ™S"MW G « SP^CXXVFVL X—VL _™LNS 
801 FSNEEKDGSL EGETRKTKVQ LALDRFRRAF SFMLHALQSF CCKKCMOOK 
851 PKPKETTESF AGENKDSILP DARPWKEYDT DMALYTGQAG APLAPLAEVE 
9 01 DDVEYCGEGG ALPTSQHSAG VQAGDLPPET KQLTSPDDQG VEMEVFSEED 
951 LHLSIQSPRK KSDAVSMLSE CSTIDLNDIF RNLQKTVSPK KQPDRCFPKG 
10 01 LSCHFLCHKT DkWsPWVLW WNIRKTCYQI VKH SW FESFX_ ^f_ G _ A 
10 51 LX~S RPQVEKLLRC TDNIFTFXF. LEMILKWVAF^ ^GFRRYFTS AW 

uoi cwUxvw s^Vs'Ik^^^ gmkvwyalx 

LVCLX^VfTilTg^FSG KFGRCINGTD INMYLDFTEV 
1201 M^'i'VS^ 5 "^™ ^™ EIMNAAVDSR 

>'dfeV nlyaylyfw fiifgsfftl nlfigvixdn FNQQQKKLGG 



— -1HS3 1 1 Rl 5 } 

1151 SAIPAILNVL 



1251 EKDEQPDFEA 

I IIIS6 



Figure 2H: SEQIDNO: 2 



13 01 QDIFMTEEQK KYYNAMKKLG TKKPQKPIPR PLNKCQAFVF DLVTSQVFDV 
135 1 IILGLIVLNM IIMMAESADQ PKDVKKTFDI 

140 1 RQ^TNG^'^DCvUvLS IISTLVSR^E DSDISFPPTL FRVVRLARIG 
1451 RILRLVRAAR "gVrVlLFALM "mSlVsLFNIG ^mAIP^ 
1501 VKKG S GXDDl"FN;ETFTGSM LCLFQITTSA GWDTLLNPML EAKEHCNSSS 
1551 QDSCQQPQIA WVFVSVIII SFLIWNMYI AVILENFNTA TEESEDPLGE 
16 01 ^l*^'^^™^** DALPEPLRVA KPNKFQFLVM 
16 51 DLPMVMGDRL HCMDVLFAFT TRVLGDSSGL DTMKTMMEEK FMEANPFKKL 
1701 YEPIVTTTKR KEEEQGAAVI QRAYRKHMEK MVKLSLKDRS SSSHQVFCNG 
1751 DLSSLDVAKV KVHND* 



/?_../... . e. .//. . Li- J - 



Figure 3 A: SEQ ED N0:3 
1 GCTGAGCAGT GGGGCACTGA TATTTGAAGA TGTTCACCTT GAGAACCAAC 

51 CCAAAATCCA AGAATTACTA AATTGTACTG ACATTATTTT TACACATATT 

101 TTTATCCTGG AGATGGTACT AAAATGGGTA GCCTTCGGAT TTGGAAAGTA 

151 TTTCACCAGT GCCTGGTGCT GCCTTGATTT CATCATTGTG ATTGTCTCTG 

201 TGACCACCCT CATTAACTTA ATGGAATTGA AGTCCTTCCG GACTCTACGA 

251 GCACTGAGGC CTCTTCGTGC GCTGTCCCAG TTTGAAGGAA TGAAGGTGGT 

301 GGTCAATGCT CTCATAGGTG CCATACCTGC CATTCTGAAT GTTTTGCTTG 

351 TCTGCCTCAT TTTCTGGCTC GTATTTTGTA TTCTGGGAGT ATACTTCTTT 

401 TCTGGAAAAT TTGGGAAATG CATTAATGGA ACAGACTCAG TTATAAATTA 

451 TACCATCATT ACAAATAAAA GTCAATGTGA AAGTGGCAAT TTCTCTTGGA 

501 TCAACCAGAA AGTCAACTTT GACAATGTGG GAAATGCTTA CCTCGCTCTG 

551 CTGCAAGTGG CAACATTTAA GGGCTGGATG GATATTATAT ATGCAGCTGT 

601 TGATTCCACA GAGAAAGAAC AACAGCCAGA GTTTGAGAGC AATTC AC TCG 

651 GTTACATTTA CTTCGTAGTC TTTATCATCT TTGGCTCATT CTTCACTCTG 

701 AATCTCTTCA TTGGCGTTAT CATTGACAAC TTCAACCAAC AGCAGAAAAA 

751 GTTAGGTGGC CAAGACATTT TTATGACAGA AGAACAGAAG AAATACTATA 

801 ATGCAATGAA AAAATTAGGA TCCAAAAAAC CTCAAAAACC CATTCCACGG 

851 CCCGTT 



Figure 3B:SEQ ID N0.3 



(Human PN5 is top line) 
(Rat PN5 is bottom line) 



1 Mill 

1001 LSCHFLCHKTDKRKSPVWLWWNIRKTCYQIVKHSWFESFIIFVILLSSGA 1050 

6 LIFBDVKLENQPKIQELraiFTHIFIL^^ " 

l05 i iiFEiUpsfeiLULiUFik^xiffi^UiiU uoo 
noi UU^U^sULsUiF^ipiiiiiUUlUUviii 115 

106 GAIPAIIaNVLLVCLIFWLVFCILGVYFFSGKFGKCING^ . SVINYTII 153 
H51 siiUiiUiUciiF^UlGLFUiFUcUclL^FiEV 120 

154 TOKSQCESS^SWIJWICVNF^^ 203 

1301 1250 
204 EKEQQPEFESNSLGYIYFVWIIFGSFFTLNLFIGVII^ 253 

1251 EKDEQPDFEANLYAYLYFVwilFG^ »0 

285 

254 QDIFMTEEQKKYYNAMKKLGSKKPQKPIPRPV 

1301 U^EE^Y^ 135 



0 



00 



0 

285 

0 



Co Jinn. V~'L. A- U I 



1 CTCAACATGG 

51 GACGAAGGTT 

101 GCGAGTGTGT 

151 GGCTGGAACG 

201 GCTGTTTCT 

251 TCTTCCGGGT 

301 CGAGCAGCCA 

351 GCCCGCCCTC 

401 ACTCCATCTT 

451 ATCGACGACA 

501 GTTCCAGATC 

551 TCAACACGGG 

601 TCCCGGGGGA 
651 CTACATCATC 
7 01 TC 



TTACGATGAT 
CTGGGCAGAA 
GATGAAGATG 
TGTTCGAcTT 
GCAATCCTTA 
CATCCGTCTG 
AGGGGATTCG 
TTCAACATCG 
CGGCATGGCC 
TGTTCAACTT 
ACCACCTCGG 
GCCTCCCTAC 
ACTGCGGGAG 
ATCTCCTTCC 



Figure 4: SEQ 
GGTGGAGACC 
TCAACCAGTT 
TTCGCCCTGC 
CATAGTGGTG 
AGTCACTGGA 
GCCAGGATCG 
CACGCTGCTC 
GCCTCCTCCT 
AGCTTCGCTA 
CAAGACCTTT 
CCGGCTGGGA 
TGCGACCCCA 
CCCGGCGGTG 
TCATCGTGGT 



EDNO:4 

GACGAGCAGG 

CTTTGTGGCC 

GACAGTACTA 

ATCCTGTCCA 

AAACTACTTC 

GCCGCATCCT 

TTCGCCCTCA 

CTTCCTCGtC 

ACGTCGTGGA 

GGCAACAGCA 

CGGCCTCCTC 

ACCTGCCCAA 

GGCATCATCT 

CAACATGTAT 



GCGAGGAGAA 

GTCTTCACGG 

TTTCACCAAC 

TTGGGAGTCT 

TCCCCGACGC 

CAGGCTGATC 

TGATGTCCCT 

ATGTTCATCT 

CGAGGCCGGC 

TGCTGTGCCT 

AGCCCCATCC 

CAGCAACGGC 

TCTTCACCAC 

ATCGCAGTCA 



U-J. 



Figure 5A: SEQIDNO: 5 
1 GTCGACTCTA GATCAGGGTG AAGMSGAGG AGAGGTACTA CCCGGTGATC 
51 TTCCCGGACG AGCGGAATTT CCGCCCCTTC ACTTCCGACT CTCTGGCTGC 
101 CATAGAGAAG CGGATTGCTA TCCAAAAGGA GAGGAAGAAG TCCAAAGACA 
151 AGGCGGCAGC TGAGCCCCAG CCTCGGCCTC AGCTTGACCT AAAGGCCTCC 
201 AGGAAGTTAC CTAAGCTTTA TGGTGACATT CCCCCTGAGC TTGTAGCGAA 
251 GCCTCTGGAA GACCTGGACC CATTCTACAA AGACCATAAG ACATTCATGG 
301 TGTTGAACAA GAAGAGAACA ATTTATCGCT TCAGCGCCAA GCGGGCCTTG 
351 TTCATTCTGG GGCCTTTTAA TCCCCTCAGA AGCTTAATGA TTCGTATCTC 
401 TGTCCATTCA GTCTTTAGCA TGTTCATCAT CTGCACGGTG ATCATCAACT 
451 GTATGTTCAT GGCGAATTCT ATGGAGAGAA GTTTCGACAA CGACATTCCC 
501 GAATACGTCT TCATTGGGAT TTATATTTTA GAAGCTGTGA TTAAAATATT 
551 GGCAAGAGGC TTCATTGTGG ATGAGTTTTC CTTCCTCCGA GATCCGTGGA 
601 ACTGGCTGGA CTTCATTGTC ATTGGAACAG CGATCGCAAC TTGTTTTCCG 
651 GGCAGCCAAG TCAATCTTTC AGCTCTTCGT ACCTTCCGAG TGTTCAGAGC 
701 TCTGAAGGCG ATTTCAGTTA TCTCAGGTCT GAAGGTCATC GTAGGTGCCC 
751 TGCTGCGCTC GGTGAAGAAG CTGGTAGACG TGATGGTCCT CACTCTCTTC 
801 TGCCTCAGCA TCTTTGCCCT GGTCGGTCAG CAGCTGTTCA TGGGAATTCT 
851 GAACCAGAAG TGTATTAAGC ACAACTGTGG CCCCAACCCT GCATCCAACA 
901 AGGATTGCTT TGAAAAGGAA AAAGATAGCG AAGACTTCAT AATGTGTGGT 
951 ACCTGGCTCG GCAGCAGACC CTGTCCCAAT GGTTCTACGT GCGATAAAAC 
1001 CACATTGAAC CCAGACAATA ATTATACAAA GTTTGACAAC TTTGGCTGGT 
1051 CCTTTCTCGC CATGTTCCGG GTTATGACTC AAGACTCCTG GGAGAGGCTT 
1101 TACCGACAGA TCCTGCGGAC CTCTGGGATC TACTTTGTCT TCTTCTTCGT 



Figure 5B: SEQ ID NO: 5 
ll5 l GGTGGTCATC TTCCTGGGCT CCTTCTACW GCTTAACCTA ACCCTGGCTG 
1201 TTGTCACCAT GGCTTATGAA GAACAGAACA GAAATGTAGC TCCTGAGACA 
1251 GAGGCCAAGG AGAAAATGTT TCAGGAAGCC CAGCAGCTGT TAAGGGAGGA 
1301 GAAGGAGGCT CTGGTTGCCA TCGGAATTGA CAGAAGTTCC CTTAATTCCC 
1351 TTCAAGCTTC ATCCTTTTCC CCGAAGAAGA GGAAGTTTTT CGGTAGTAAG 
1401 ACAAGAAAGT CCTTCTTTAT GAGAGGGTCC AAGACGGCCC AAGCCTCAGC 
1451 GTCTGATTCA GAGGACGATG CCTCTAAAAA TCCACAGCTC CTTGAGCAGA 
1501 CCAAACGACT GTCCCAGAAC TTGCCAGTGG ATCTCTTTGA TO AGCACGTG 
1551 GACCCCCTCC ACAGGCAGAG AGCGCTGAGC GCTGTCAGTA TCTTAACCAT 
1601 CACCATGCAG GAACAAGAAA AATTCCAGGA GCCTTGTTTC CCATGTGGGA 
1651 AAAATTTGGC CTCTAAGTAC CTGGTGTGGG ACTGTAGCCC TCAGTGGCTG 
1701 TGCATAAAGA AGGTCCTGCG GACCATCATG ACGGATCCCT TTACTGAGCT 
1,51 GGCCATCACC ATCTGCATCA TCATCAATAC CGTTTTCTTA GCCGK3GAGC 
1801 ACCACAACAT GGATGACAAC TTAAAGACCA TACTGAAAAT AGGAAACTGG 
1851 GTTTTCACGG GAATTTTCAT AGCGGAAATG TGTCTCAAGA TCATCGCGCT 
1901 CGACCCTTAC CACTACTTCC GGCACGGCTG GAATGTTTTT GACAGCATCG 
1951 TGGCCCTCCT GAGTCTCGCT GATGTGCTCT ACAACACACT GTCTGATAAC 
2001 AATAGGTCTT TCTTGGCTTC CCTCAGAGTG CTGAGGGTCT TCAAGTTAGC 
2051 CAAATCCTGG CCCACGTTAA ACACTCTCAT TAAGATCATC GGCCACTCCG 
2101 TGGGCGCGCT TGGAAACCTG ACTGTGGTCC TGACTATCGT GGTCTTCATC 
2151 TTTTCTGTGG TGGGCATCCG GCTCTTCGGC ACCAAGTTTA ACAAGACCGC 
2201 CTACGCCACC CAGGAGCGGC CCAGGCGGCG CTGGCACATG GATAATTTCT 
2251 ACCACTCCTT CCTGGTGGTG TTCCGCATCC TCTGTGGGGA ATGGATCGAG 
2301 AACATGTGGG GCTGCATGCA GGATATGGAC GGCTCCCCGT TGTGCATCAT 



Figure 5C: SEQ ID NO: 5 
2351 lOTCTTTGTC CTGATAATGG 

2401 TCATTGCCTT GC^AA, ^CAGCA -AGGAGAA ««« 
45 1 CTGGAAGGAG AGACCAGGAA AACCAAAGTG CAGC.AGCCC TGGATCGGTT 
5 01 CCGCCGGGCC ****** —ACGC TCTTCAGAGT ^CA 
2551 AGAAATGCAG GAGGAAAAAC TCGCCAAAGC CAAAAGAGAC AACAGAAAGC 
2601 AGAATAAAGA C T CAA T CC T C CCGGATGCGA GGCCCTGGAA 

265 1 GGAGTATGAT ACAGACATGG CT^ACAC TGGACAGGCC GGGGCTCCGC 
270 1 TGGCCCCACT CGCAGAGGTA GAGGACGATG TGGAATATTG TGGTGAAGGC 
2751 GG^CCCTAC CCACCTCACA ACATAGTC3CT GGAGTTCAGG CCGGTGACCT 

ml . rM . TCACTAGCCC GGATGAOCAA GGGGTTGAAA 
2801 CCCTCCAGAG ACCAAGCAGC TCAL1AUV. 

285 1 TGGAAGTATT TTCTGAAGAA GATCTGCATT TAAGCATACA GAGTCCTCGA 
2901 AAGAAGTCTG ACGCAGTGAG CATGCTCTCG GAATGCAGCA CAATTGACCT 
2951 GAATGATATC TTTAGAAATT TACAGAAAAC AGTTTCCCCC AAAAAGCAGC 

rrrCTTAGTT GTCACTTTCT ATCCCACAAA 
3001 CAGATAGATG CTTTCCCAAG GGCCTTAGT1 u 

305 1 ACAGACAAGA GAAAGTCCCC CTGGGTCCTG TGGTGGAACA TTCGGAAAAC 
3101 CTGCTACCAA ATCGTGAAGC ACAGCTGGTT .GAGAG^ ATAATCTTTG 
315 1 TTATTCTGCT GAGCAGTGGA GCGCTGATAT TTGAAGATGT CAATGTCCCC 
32 01 AGCCGGCCCC AAGTTGAGAA ATTACTAAGG TCTACCGATA ATATTTTCAC 

TPATCCTGAA GTGGGTGGCC TTTGGATTCC 
3251 ATTTATTTTC CTCCTGGAAA TGATCCTGAA * 

3301 GGAGGTATTT CACCAG^CC TGG^C ^TTCC, CATTGTGGTG 
2251 GTGTCTGTGC TCAGTCTCAT GAATCTACCA AGCTTGAAGT CC^CGGAC 
340 1 XC^CGGGCC CTGAGACCTC TGCGGGCGCT GTCCCAGTTT GAAGGAATGA 
3451 AGGTTGTCGT C.ACGCCCTG ATCAGCGCCA TACC^CCA, TCTCAATGTC 
350 1 TTGCTC3GTCT GCCTCATTTT CTGGCTCGTA TTTTGTATCT TGGGAGTAAA 



Figure 5D: SEQ ID NO: 5 
3551 TTTATTTTCT GGGAAGTTTG GAAGGTGCAT TAACGGGACA GACATAAATA 
3601 TGTATTTGGA TTTTACCGAA GTTCCGAACC GAAGCCAATG TAACATTAGT 
3651 AATTACTCGT GGAAGGTCCC GCAGGTCAAC TTTGACAACG TGGGGAATGC 
3701 CTATCTCGCC CTGCTGCAAG TGGCAACCTA TAAGGGCTGG CTGGAAATCA 
3751 TGAATGCTGC TGTCGATTCC AGAGAGAAAG ACGAGCAGCC GGACTTTGAG 
380! GCGAACCTCT ACGCGTATCT CTACTTTCKS GTTTTTATCA TCTTCGGCTC 
3S51 CTTCTTTACC CTGAACCTCT TTATCGGTGT TATTATTGAC AACTTCAATC 
3901 AGCAGCAGAA AAAGTTAGGT GGCCAAGACA TC.TTCATGAC JGASGASCAG 
3951 AAGAAATATT ACAATGCAAT GAAAAAGTTA GGAACCAAGA AACCTCAAAA 
400! GCCCATCCCA AGGCCCCTGA ACAAATCTCA AGCCTTTGTG TTCGACCTGG 
4051 TCACAAGCCA GGTCTTTGAC GTCATCATTC TGGGTCTTAT TGTCTTAAAT 
4101 ATGATTATCA TGATGGCTGA ATCTGCCGAC CAGCCCAAAG ATGTGAAGAA 
4151 AACCTTTGAT ATCCTCAACA TAGCCTTCGT GGTCATCTTT ACCATAGAGT 
4201 GTCTCATCAA AGTCTTTGCT TTGAGGCAAC ACTACTTCAC CAATGGCTGG 
4251 AACTTATTTG ATTGTGTGGT CGTGGTTCTT TCTATCATTA GTACCCTGGT 
4301 TTCCCGCTTG GAGGACAGTG ACATTTCTTT CCCGCCCACG CTCTTCAGAG 
4351 TCGTCCGCTT GGCTCGGATT GGTCGAATCC TCAGGCTGGT CCGGGCTGCC 
4401 CGGGGAATCA GGACCCTCCT CTTTGCTTTG ATGATGTCTC TCCCCTCTCT 
4451 CTTCAACATC GGTCTGCTGC TCTTCCTGGT GATGTTCATT TACGCCATCT 
4501 TIGGGATGAG CTGGTTTTCC AAAGTGAAGA AGGGCTCCGG GATCGACGAC 
4551 ATCTTCAACT TCGAGACCTT TACGGGCAGC ATGCTGTGCC TCTTCCAGAT 
4601 AACCACTTCG GCTGGCTGGG ATACCCTCCT CAACCCCATG CTGGAGGCAA 
4651 AAGAACAC1G CAACTCCTCC TCCCAAGACA GCTGTCAGCA GCCGCAGATA 
4701 GCCGTCGTCT ACTTCGTCAG TTACATCATC ATCTCCTTCC TCATCGTGGT 
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